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processes for obtaining such p.ants and 9e^c ZZl anT^ls *" • and 

-n .nfected plants to other plant pathogens and plant pests « T 3 h,9her susce P tib »"y «° infection 

•nsect or funga, carriers or may occur'h^ ° f P ' ant ^ ^neral.y occur, via 

« commercial plants into commercial varieties through brSno Th. f ""* d ^ W " d types re,ated *> 
from thewi.d type gene poo. to a cu.tivar is a tediot proce ! I wh*h7h " eXiSt ' n9 reSiSt3nCe in the wi,d 
first be identified in a source (donor) plant species andThen J ^s.stance conferring gene(s) must 
nety. Resistance or tolerance generated I ZTayTwZ^Tr T ^ BeTO P °°' ° f 3 com ™rcia. va- 
of the virus in question . One disadvantage o ^ ° n ' y a9ainst one or * °est a few strains 

s that there is often a lack of a gene source su tab!e for ™ * Stance to a particular virus species is 

Other approaches to HmiUhe effec of virul induced I dteaT? T W ' thin the crop 

as insecticides, fungicides and the like wS Z aaS P * inC ' Ude the USe 0f ch emicals such 

ventative methods such as efficient °" the emp,oy ™' " P- 

bat vrus disease by killing the carrier is m^^^^t^'^ 0 ^' ^ ^ ° f Chem,C3,S to com ' 

> growers with a decreasing scala of permitted clJ^"^^^^^ ~nt 

a second related virus strain. The cross p^tecS n nl w «TT. lh * P ' 3nt 3931081 s ^^n with 
strains to infect p.ants , which act to inhibit a s ^ '"^ the USe of aviru,ent ™us 

ever, the use of a natura. cross-protection system can Lv^ "f£ ' "T^ ^ ° f the Same virus " How " 
intensive because it requires inoculation of eve™ Scron and « "-thcd is very .abour 
mutate to a virulent strain, thus becoming ^ s T a a^Z'r 5 * ^ ^ that 30 avirulent strai " ™V 
is that an avirulent virus strain in one plant speoS caTa* L T 7*** ^ AfUrther pOSsib,e ha2a * 

Several studies have indicated tha' the Soat omit Z t i * " Wn h an ° ther plant specie «- 
cross-protection and that protection oo^^^^Z^^T^ *** *" imp ° rtant ro,e in 
or closely related coat protein structures WU8 a " d the challe "9ing virus have the same 

meth^^ have given rise to new 

res,stance which phenotypical.y resembles natuS. « SSLSTZv "T^*' 0 " " 3 for ^ of virus 
of genetic information of a viral coat protein from^S™ n ?l ' . ,S n ach,eved throu 9h the expression 
of virus resistance via genetic engineering has teen h f 3 9enet,C3 " y manipulated plant . Generation 

Abel et a. [(1986) Science ^T^ISTS Z shown^ " mSt3nCe ' EP 223 452 and re P°^° by 
(TMV-U1) coat protein gene from the a^Z: I™ LI™" 3 "* 8 ' 0 " ° f the tobacc ° mosaic virus strain U1 
after infection with any TM V strain SMa^ J^h reSU ' ted 3 de ' 3y ° f Symptom development 

been obtained for alfa.fa mosaic vi usTa^^^ have a.so 

Although TMV, CMV. AMV and PVX belonc ito dZZf • } *"* m ° Saic virus < CMV )- 

in a., such viruses the viral RNA is a ^ i T^ 'Z" T^A ^ ^ 3 C ° mmon architecture: 
dividual but identical viral coat proteins encaps.dated by a viral coat consisting of many in- 

spo^X^ the P ,ant viruses mentioned above. The genus tc- 

sphenca. in shape (80-120 nm h^S^STZ^: ^ thn ' PS - The V ' Vus P^«« a - 

studded with glycoprotein surface p W jJ^ 1 ^,£^r^ W " to SUrrounded a «P« envelope 
molecules of negative or ambisense i^j^^SS^S^ ° f " ne3r sin 9 |e stranded ™* 

«ed by a consensus sequence as fol loU 5' 3^ ° f ? 636 RNAmo '^'es are character- 

U . Members of the tospovirus group Zude iSIS^S""* ' ■ GAUUGCU CU 3', wherein X is C or 
(INSV), and tomato chlorotic spot vwZTcsT^L ^ ^ (TSWV) ' ,mpatiens necro «° «P« virus 
like isolate BR-C3. A genera. ^L^^^^Z^^ ^ ^ ^ ( ™ SV) " TSWV " 
v-ruses can be found in our co-pending *HtkJ^£?3l^ * ° f the genus tos P°" 

The tospovirus particle contains at laSJSLrt 2? 1 . incorporated by reference . 
of 28 kd and two membrane JESS'S kTaTd^" interna ' nUC,eOC3pSid protei " N 

5000 nt. (M R NA ) and about S 9 00 9 nt(^^ 
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copies of the L protein to form circular nucleocapsids. A schematic structure outlining most properties of an 
Ssvl given in Figure 1. Based on the above and other properties , INSV (like TSWV) has been classed 
as a member of the tospovirus genus. . 

Circumstantial evidence has been presented which suggests that an M RNA encoded I gene is directiy or 
indirectly involved in the synthesis of the G1 membrane glycoprotein [Verkleij and Peters ,(1 983) J. Gen. V.rol. 

64:6 II mentioned above, tospoviruses such as TSWV , INSV and the like are transmitted by certain species 
of thrips. These tospovirus carriers belong to the family Tripidae and include tobacco thf (P«;«™^ 
(Hinds )). western flower thrips (F. occidental!* (Pergande)), common blossom thrips (F. Schultzei (Trybom)), 
chilli thrips {Scirtothrips dorsalis (Hood)). Thrips setosus (Moulton), onion thrips (T. tabaa (L.ndeman)) F. m- 
tonsa and melon thrips (T. palmi (Karny)). The tospovirus is acquired by thrips only during their larval stagey 
Larvae can transmit the virus before they pupate but adults more commonly transmrt the v.rus. Adult thnps 
can remain infective throughout their lives . 

Tospoviruses are widespread in temperate, subtropical and tropical climate zones throughout the world . 
The current distribution of tospoviruses covers all continents and makes them one of the most w.dely d.stnb- 
uted of groups of plant viruses. At least 370 plant species representing 50 plant families, both monocotyledons 
and dicotyledons , are naturally infected by tospoviruses of the Bunyaviridae . Tospoviruses seriously affect 
the production of food and ornamental crops . Symptoms of tospovirus infection in plants include stunting, ring- 
spots dark purple-brown sunken spots, stem browning, flower breaking, necrotic and p.gmental tenons and 
patterns yeLs and non-necrotic mottle, mosaic in greens or even total plant death. Most plant hosts ; display 
only a few of these symptoms, however, the wide range of symptoms produced by tospovirus infection has 
complicated diagnosis of the disease and has led to individual diseases being given several different names. 
Afurther complication is that tospovirus symptoms within the same plant species may vary depending on the 
age of the plant, time of infection during the life-cycle of the plant . nutritional levels . env.ronmental conditions, 
such as temperature , and the like. 

Although TSWV has been known for many years, is widely distributed, and is the causal agent of a disease 
which leads to significant loss in yield in crops and ornamentals, limited progress has been made ,n .dent, ying 
sources of genes capable of conferring resistance to TSWV or other tospov.ruses . Amonogemc TSWV ^toler- 
ance has been identified in Lycopersicon peruvianum, but this trait has not been transferred to cu.hvated tom- 
atoes so far . nor has a resistance source been identified for other crop spec.es . The use of natural cross- 
protection systems to decrease the invasive effects by tospovirus strains capable of causing damage is not 
well documented. Limited positive results have been reported for tomato and lettuce. 

The introduction of genetic information capable of conferring resistance or tolerance to tospoviruses into 
plant gene pools by means of genetic manipulation provides the breeder and grower alike w. h a nev, r method 
forcombatting tospovirus induced disease. In particular, ithas bee n found that genetic manipulation techn.ques 
may be employed to confer resistance to INSV related disease in plants . 

Detailed Description 

According to the present invention there is provided a recombinant INSV DNAconstruct comprising a DNA 
sequence coding for transcription into 

a) an RNA sequence of an INSV or an RNA sequence homologous thereto ; frtPanl K,e V 
b an RNAsequence of an INSV or an RNA sequence homologous thereto capable of encod.ng for an INSV 
protein or a part thereof . in which one or more codons have been replaced by synonyms . or an RNA se- 
quence homologous thereto ; or 

c) an RNA sequence complementary to an RNA sequence according to a) orb). 

which INSV DNA is under expression control of a promoter capable of functioning in plants and includes 
a terminator capable of functioning in plants. 

The DNA sequences defined under a), b) and c) above, for the purposes of the present invention will be 
referred to as 'INSV Related DNA Sequences* hereinafter. An INSV Related DNA Sequence according , to .the 
invention may be modrfied as appropriate to create mutants or modified sequences homotogous to such INSV 
Related DNA Sequences from which they are derived, using methods known to those skilled m the art such 
as site-directed mutagenesis and the like. Such mutants or modified coding sequences are embraced wrthm 

1 h6 S?t n-ofan INSV ■ may refer to a sequence of the S, M or L RNA strand, preferably 

an S or M RNA strand, more preferably to an S RNA strand of an INSV. 

The term " RNA sequence homologous to an RNA sequence of an INSV ■ refers to an RNA sequence of 
an INSV wherein a number of nucleotides have been deleted and/or added but which is still capable of hybnd- 
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s y s,r:t:^^rs« rc^^ri'r^r 8 "' invemion mav inv * e toba,i °" *> * «-«» 

* u . » muu«» ai u ana washing 3x in a buffer compr s nq 0 1 x SSC and 0 1% t?n<j of «op 

bacterTa^^TTH 8 "" 3 ? 6 ^ ° NA C ° nStrUCtS ° f the P resent inv *"«°" '"dude viral fungal 

Lirrmed^rTr^ 6 " 1 h Pr ° dUCe am ° Unt ° f RNA ^^^^^^S^T. 
fransformed plant. The requ.red amount of RNA to be transcribed may vary with the type of plant Par culiri! 
preferred promoters include the cauliflower mosaic vim* tr a u\i " P Part| c"'arly 

if it i^hi^of anCe Pr8Sent inVenti ° n ' a " ^sequence is complementary to another RNA sequence 

«. z^n^c™ ::rr pabte - ln,^oduc, " 9 ,he dna — - - — 
ir.s «: : p ,ni ° ™"" e p,ants ■ pro9eny ana >-* th ~- «* - . s« zt 

m»IHll i !lr , *5 n . Pr "" deS Pla " 15 in «■* senome a DNA construct of the invention end 

nil.tlS'^S, 30 ?!!!"' 9 10 ' h * have ret "" !ea 8US «P«M»1' to "Wee induced by INSV or diseases 

not lfm U HeVtoToll a ri nC,Ude l he ° r A namental ^flowering plants. Examples of such plants include butare 
v2 Si Ageratum . Amaranthus , Anthirrhinum , Aquilegia . Begonia , Chrysanthemum Cineraria do! 
ver, Cosmos , cowpea , Cydamen , Dahlia . Datura , Delphinium , Gerbera , Gladiolus G ToxSa HiL'ett 
fr ™ ' ' mpatienS ' M-mbryanthemum • P-*"* , Primula . Saint Pau.ia . Salpigfots ,' SgeTeL /verbena 
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Viola , Vinca , Zinnia , Pelargonium and the like . 

Other types of plants may be susceptible to INSV infection but these plants may not present disease symp- 
toms directly associated with INSV infection, however such plants may present symptoms of a disease as a 
s result of a secondary infection by a different organism made possible as a result of an initial infection by INSV. 
Such plants may therefore be viewed as being the subject of an INSV infection related disease and may include 
plants selected from a wider group of plant types. Further examples of this group of plant types may include 
vegetable and other crops . Such crop types include alfalfa , aubergine , beet , broad bean , broccoli . brussels 
sprouts , cabbage , cauliflower . celery , chicory , cow pea , cucumber , endive , gourd , groundnut , lettuce , 
10 melon , onion . papaya , pea , peanut . pepper , pineapple , potato , saff lower , snap bean . soybean , spinach , 
squash , sugarbeet , sunflower , tobacco , tomato , water melon and the like . 

The invention relates in particular to ornamental plants and preferably to those listed ornamental plants 
comprising in their plant genome a DNA construct of the invention. 

The particular features of tospoviruses including those of INSV are illustrated hereinafter. 
15 The S, M and L RNA are single stranded RNA molecules. The S RNA of INSV is about 3000 nucleotides 

long( SEQ. ID No.1 ; SEQ ID No. 2 ) and comprises two genes, one (SEQ ID No.3) encoding a non-structural 
protein (NSs) in viral sense, the other one (SEQ ID No.11) encoding the nucleocapsid protein (N) in viral com- 
plementary sense. The intergenic region between the NSs- and N-gene can be folded into a secondary struc- 
ture (Seq ID No. 7 and SEQ ID No.8) . The 5*- and 3'-terminal sequences of the S RNA are capable of hybnd- 
20 izing to each other such that the first nucleotide is opposite (and complementary) to the last nucleotide of said 
S RNA strand . For the purposes of the description the double-stranded structure obtained by hybridizing both 
RNA termini will be referred to as a "pan-handle" (SEQ ID No.5 and SEQ ID NO. 6) hereinafter. 

The M RNA strand of INSV comprises about 5000 nucleotides (SEQ ID No. 14) . It contains at least two 
open reading frames, one encoding a non-structural protein (NSm) in viral sense ( SEQ ID No.1 5), and another 
25 open reading frame (SEQ ID No.21) in viral complementary sense. This open reading frame is translated on 
polysomes located on the endoplasmic reticulum where the nascent polypeptide chain is cleaved co-transla- 
tionally to form the spike proteins G1 and G2 respectively. As with S RNA, the termini of the M RNA strand 
are complementary to each other and may likewise hybridize to form a " pan-handle " ( SEQ ID No. 1 8 and SEQ 

ID No.19). _, . _, . 

30 The L RNA strand of INSV comprises about 8900 nucleotides. It contains complementary 3 and 5 ends 

for a length of from about 50 to about 80 nucleotides. The RNA has a negative polarity, with one open reading 
frame (ORF) located as the viral complementary strand. This ORF corresponds to a primary translation product 
of about 2875 amino acids in length with an anticipated Mw of between about 300.000 to about 350.000. Com- 
parison with the polymerase proteins of other negative strand viruses indicates that this protein probably rep- 
35 resents a viral polymerase. In some mutant strains, shortened L RNA molecules have been found in addition 
to the wild type, full length L RNA. These shortened L RNAs however are observed to possess the character- 
istic terminal nucleotide sequences and thus are capable of forming "pan handle" structures. They are also 
encapsidated with nucleocapsid protein and are included in virus particles. Their presence suppresses symp- 
tom development resulting in less severe detrimental effect . Thus , these shortened L RNA molecules can be 
40 regarded as defective interfering (Dl) RNAs . A defective interfering RNA is one which is capable of interfenng 
in replication by competing with other genomic RNAs for polymerases and therefore is capable of being r pli- 
cated , and by so doing inhibits the replication and/or expression of other genomic RNAS with which it is com- 
peting Thus . a Dl RNA may comprise any RNA sequence which is capable of being replicated and may be 
an L S or M RNA within the context of the present invention . Such Dl RNA sequences may comprise RNA 
45 sequences which have had nucleotides either deleted from or added thereto provided that they are capable 
of competing for polymerases and of replicating . 

A preferred embodiment of the invention relates to DNA constructs of the invention coding for transcription 
into INSV RNA sequences of a "pan-handle" (SEQ ID No.5 . SEQ ID No.6 ; SEQ ID No.18 . SEQ ID No.19), 
or into INSV RNA sequences homologous thereto, 
so Another preferred embodiment of the invention relates to DNA constructs of the invention coding for tran- 

scription into INSV-RNAsequences of an open reading frame in viral complementary sense i.e. having negative 
polarity . or into corresponding RNA sequences in which one or more codons have been replaced by their syn- 
onyms, or into RNA sequences homologous threto. 

Afurther preferred embodiment of the invention relates to DNA constructs of the invention coding fortran- 
55 scription into INSV-RNAsequences of a hairpin ( SEQ ID No.7 . SEQ ID No.8 ; SEQ ID No.13 . SEQ ID No.16) . 
or into RNA sequences homologous th reto. 

Preferably, the INSV-RNA sequence referred to hereinabove has at least 20 nucleotides. Pr ferably, the 
INSV-RNA sequence has at least 50 nucleotides. 

Examples of DNA constructs suitable for use according to the invention include INSV-Related DNA Se- 
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quences coding for transcription into (reference is made to the sequence listing V 
0 the viral S RNA nucleotide sequence from 1 to 3017 (SEQ ID No 1) 
VO? V,ral ,^ RNA ™cleotide sequence from position 25 to 3017 (SEQ ID No 2Y 
... the viral S RNA nucleotide sequence from 87 to 1436 (SEQ ID No 3 

iv) the viral S RNA nucleotide sequence from 2080 to 2868 (SEQ ID No 4V 

v) the viral S RNA "pan-handle -structure comprising- ' 

vl a , f s rS RNA C,e ° tide S6qUenCe ° f fr ° m 3b0Ut 30 t0 3b0Ut 36 nudeotides 'njength from the 5" end of the 

and 

l^ira?s d RNA ,e0tide SeqUenCe " ^ 3b ° Ut 3 ° * 3b ° Ut 36 nud ~«~ from the 3' end of 

vi) the viral S RNA nucleotide sequence from 1437 to 2079; (SEQ ID No 7) 
vn) the viral S RNA nucleotide sequence from 1440 to 2041; (SEQ ID No 8 

Z °.^ e com P |ementar y S RNA nucleotide sequence from 1 to about' 3017; (SEQ .D No 9) 

5 t£ JE? 00m ? amanlw y S RNA nuc,eotide -quene. from 1 to 2993; (SEQ D No 10) 

x i th e 4 RuT' ? mentary S RNA nUC,60tide Sequence from 150 to 938; (SEQ D No 11 ) 

xi) the s RNA nucleotide sequence from 1581 to 2930 of the vira, complemen^ry S RN^strand; (SEQ ID 

syno S ny^ nUC,e ° tide 87 to ™ «" which one ormore codons have been replaced by their 

t^sylZT^ ^ 2 ° 8 ° * 2868 WhiCh ° ne <* ™ re «*>™ *™ -P-aced by 

xv) the M RNA nucleotide sequence from 1 to 4970 (SEQ ID No 14)- 

xvi) the M RNA sequence from 86 to 997 (SEQ ID No 15)- 

r ;v 470 (SEQ ,D no16): 

and 

v b L a ,M RNA nUde ° tide S6qUenCe ° f fr0m ab ° Ut 30 1 ° 3bOUt 36 nucleotides * -9th from the * end of the 
xx) the complementary viral M RNA sequence from 1 to 4970; (SEQ ID No 20) 

T a z e ^z::z t^vtssr^ from position 87 to posiuon 3500 ° f the 

SSf ° mp,ement *y viral M sequence from position 3974 to 4885 (SEQ ID No 22) 
= Asequenceshomo«o g oustothenuc,eotide S e^ 

xxiv) fragments of sequences defined under i) to xxii) hereinabove. 

ments th^boi^^iSSJSS^ ^ ° A sequences homologous thereto, or intofrag- 
at least 50 nucleotides "ucleoudes. more preferably at least 20 nucleotides, and most preferably 

INSVR^^^^ in r nU0 " th h 6 DNA -^ts of the invention comprise 

<.e -pan-handles) of vfr^JTSSS^ 

S RNA . Examples of S RNA and M RNA terminal i^ZSSS ^ ^ ^ ° f 

.) a f rat nucleotide sequence 36 nucleotides in length from the 5' end of the viral S RNA : 

5 ' AGAGCAATNN NNNNNNNNNN NNNNGAACAAC CCAAGC 3 ' 
(SEQ IDNo.5 ie nucleotides from position 1 to 36 of SEQ ID No.1 . where N stands for A.T.G.or C) 
a second nucleotide sequence 36 nucleotides in length from the 3' end of the viral S RNA: 
5 ' GATTATATG ATGTTATATT CGTGACACAA TTGCTCT 3 ■ 
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(SEQ ID No.6 ie nucleotides from position 2981 to 3017 of SEQ ID No.1) 
ii) a first nucleotide sequence of 36 nucleotides in length from the 5' end of the viral M RNA : 

5 ' AG AG C AATC A GTGCATCAAA ATTATATCTA GCCGAA 3 ' 

5 

(SEQ ID No. 18 ie nucleotides from position 1 to 36 of SEQ ID No.13) 
and 

b) a second nucleotide sequence 36 nucleotides in length from the 3' end of the viral M RNA 
10 5 ' TGTTGTATGT AGAGATTTTG TTTGCACTGA TTGCTC T 3 ' 

(SEQ ID No. 19 ie nucleotides from position 4941 to 4970 of SEQ ID No. 13) 
In the case of the terminus at the 5' end of the S RNA it is not known whether or not there are sixteen or 
15 seventeen nucleotides in the unknown region demarked by a series of " N " s , however the exact number of 
nucleotides in this region is not considered to be critical to the formation of " pan-handle "structures so long 
as the 5' end of the S RNA is capable of complementing the 3' end of the S RNA thus enabling the formation 
of a "pan-handle" structure . 

The invention further provides probes suitable for use as diagnostic tools for the diagnosis of disease in 
20 plants suspected of being infected with INSV tospoviruses. Such probes comprise a labeled oligonucleotide 
(RNA or DNA) sequence complementary to an RNA sequence of an INSV tospovirus. The desired length of 
the sequence and appropriate method for diagnostic use of probes are known by those skilled in the art. A 
suitable probe may comprise a nucleotide sequence of at least 12 to about 800 nucleotides, preferably at least 
15, more preferably more than 30 nucleotides, and most preferably from about 400 to 600 nucleotides com- 
25 plementary to an RNA sequence of an INSV tospovirus . 

Probes according to the invention are helpful in identifying INSV tospovirus RNAor parts thereof in infected 
plant material i.a. for diagnostic purposes prior to full presentation of disease symptoms in plants . 

The invention accordingly also provides a diagnostic method of determining INSV tospovirus infection in 
plants which comprises detecting INSV tospovirus replicative forms employing the probes of the invention in 
30 dot-blot type assays. 

Probes according to the invention are useful in the construction of and use of chimeric genes comprising 
a DNA sequence corresponding to an RNA sequence of an INSV tospovirus. 

The DNA constructs of the invention may be obtained by insertion of an INSV Related DNA Sequence in 
an appropriate expression vector, such that the sequence is brought under expression control of a promoter 
35 capable of functioning in plants and its transcription is terminated by a terminator capable of functioning in 
plants. 

The term "appropriate expression vector" as used herein refers to a vector containing a promoter region 
and a terminator region which are capable of functioning in plant cells. 

The insertion of an INSV Related DNA Sequence into an appropriate expression vector may be carried 
40 out in a manner known per se. Suitable procedures are illustrated in the examples hereinafter. 

Likewise the construction of an appropriate expression vector may be carried out in a manner known per 

se. 

Plants according to the invention may be obtained by 

a) inserting into the genome of a plant cell a DNA construct as hereinbefore def ined; 
45 b) obtaining transformed cells; and 

c) regenerating from the transformed cells genetically transformed plants. 

DNA vectors of the present invention may be inserted into the plant genome of plants susceptible to INSV 
infection. Such plant transformation may be carried out employing techniques known per se for the transfor- 
mation of plants, such as plant transformation techniques involving Ti plasmids derived from Agrobacterium 
so tumefaciens. A. rhizogenes or modifications thereof, naked DNA transformation or electroporation of isolated 
plant cells or organized plant structures, the use of micro-projectiles to deliver DNA, the use of laser systems, 
liposomes, or viruses or pollen as transformation vectors and the like. 

Plants of the inv ntton may be monitored for expression of an INSV-Related DNA Sequence by methods 
known in the art, including Northern analysis, Southern analysis, PCR techniques and/or immunological tech- 
55 niques and the like. The plants of the invention show decreased susceptibility to INSV infection as demon- 
strated by tests whereby the plants are exposed to INSV preferentially at a concentration in the range at which 
the rate of disease symptoms correlates linearly with INSV concentration in the inoculum. 

Methods suitable for INSV inoculation are known in the art and include mechanical inoculation, and in par- 
ticular, the use of appropriate vectors. 
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Fi^ ^ f ° ,IOWing non - |imitin 9 ^ampies and accompanying figures 

Hgure 1. Schematic representation of an INSV particle . figures. 

Figure 2: Sequence strategy for INSV viral S RNA 

Figure 4: Schematic review of the construction of a suitable expression vector (oZU m 
seance' 5: SCh6matiC — - —ion of a Suitab ,e p.asmid comp^SlS N P rotein-co ding 

coding^uerfc:" 6 " 13 " 0 ^ ° f C ° nStrUCti ° n * 3 SUitaWe " asmid c ™P™ng the INSV NSs protein- 
codin? S U e e J; n S c C e hemat,C ° f C ° nStrUCti ° n ° f 3 SUitaWe " asmid c ° m P-"9 the .NSV NSm protein- 

prote^^cl^ing " ^ C ° nStrUCt ''° n * * ^ ^ *• .NSV G1/C2 glyco . 

Figure 9: Schematic review of the construction of a INSV N oenp rnnteinm« «i- * * 
^ F*» 10: Schema. re*.„ ofthe c„„ . ^KSSCSSSS^ 

p, a nfs::^r.s„r Iew °* ,he c ° nstruc "° n * a iNsvGi ' G2 *»p-o'-"^~ re or S e„ e .c„„ BWng 

to Figure 12: Schematic revie» of ,h. construction of a INSV NSm gene-containlng p, a „, transformation vec- 
S«h| 13: f* oond * r !' 8l ™<*"« located at the intergenfc region of INSV S RNA. 

goences ^ for the SSZSE I ^SSST." 1 : 

sequences. a se Q ue nce homologous to such 

Other advantageous features of the present invention will be apparent from the foNowing examp.es. 
MATERIAL AND METHODS 

stanSro^^^^^ <■> >** formation studes, are gr™ u „ oer 

strains are grown at 28»C in m^a m , standard LB-medium. Agrobacterium tumefaciens 

^a~a~ 

reclpLirSStXm,^' C °" ** ^ ^ * usee- a S a 

-jsz r^r ^™™r ( ,r^o'™^ -r — - «» * 

CO//HB101. containing the olasmid omaol^Il 303.179-180] m standard tnparental matings using the E. 
Sci USA 76164ft IJSn a • PRK2013 as a helperstrain. [Figurski and Helinski. (1979) Proc NaU Acad 

amplification using the polymerase chain reaction (PCFO were o rfr,rm»H 
supplier of the Tao polymeras (Perkin Elmer Cetus ) P * 38 r commended b V «* 

Amplications of RNA by reverse transcript of the target RNA fol.owed by standard DNA amplication 
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were performed using the Gene Amp RNA PCR Kit as recommended by the supplier ( Perkin Elmer Cetus ). 
Examples 

Example 1: Isolation of INSV particles and genetic material therein 

INSV isolate NL-07, an isolate from Impatiens, is maintained on Impatiens by grafting. Virus is purified 
from systemically infected Nicotiana rustica leaves, after mechanical inoculation essentially as described by 
Tas et al. [(1977) J. Gen. Virol. 36:81-91 ] . All material used in the isolation procedure should be maintained 
at a temperatue of 4 °C . Twelve days after inoculation 100 grams of infected leaves are harvested and ground 
for 5 - 10 seconds at a low speed setting in 5 volumes extraction buffer (0.1 M NaH 2 P0 4 , 0.01 M Na 2 S0 3 , pH 
7) in a Waring blender. The suspension is filtered through cheesecloth and the filtrate is centrifuged for 10 
minutes at 16,000 x g. The resulting pellet is resuspended in three volumes resuspension buffer (0.01 M 
NaH 2 P0 4 , 0.01 M Na 2 S0 3 , pH 7). The pellet is dissolved by stirring carefully at 4°C. After centrifuging for 10 
minutes at 12,500 x g the pellet is discarded and the supernatant centrifuged again for 20 minutes at 50,000 
x g. The pellet is resuspended in 0.2 volume of resuspension buffer (0.01 M NaH 2 P0 4 , 0.01 M Na 2 S0 3 , pH 7) 
and kept on ice for 30 minutes. Anti-serum raised in rabbits against material from non-infected Nicotiana rustica 
is added to the solution and carefully stirred for 1 hour . Non-viral complexes are pelleted after 10 minutes cen- 
trifuging at 16,000 x g. The cleared supernatant is loaded on a linear 5% - 40 % sucrose gradient in resuspen- 
sion buffer(0.01 M NaH 2 P0 4 . 0.01 M Na 2 S0 3 , pH 7) , and spun for 45 minutes at 95,000 x g. The opalescent 
band containing INSV particles is carefully collected with a syringe and diluted 4 times with resuspension buf- 
fer. Washed viruses are pelleted by centrifugation for 1 .5 hours at 21 ,000 x g and resuspended in one volume 
of resuspension buffer. Generally, 100 grams of leaf material yields approximately 0,5 mg of INSV viruses. 
INSV RNA is recovered preferentially from purified virus preparations by SDS-phenol extractions followed by 
ethanoi precipitation . From 1 mg INSV , 1-5 ug of RNA is extracted. The isolated RNA molecules are analysed 
for intactness by electrophoresis on an agarose gel. Three distinct RNA molecules are identified with apparent 
sizes of about 3000 nucleotides (S RNA), about 4900 nucleotides (M RNA) and about 8900 nucleotides (L RNA) 
respectively. 

Example 2: Sequence determination of the 3'-termini of the INSV viral RNAs 

In order to perform direct RNA sequencing , INSV RNA is extracted from purified nucleocapsids ess ntially 
according to Verkieij et al. (1983) supra . Twelve days after inoculation 100 grams of infected leaves are har- 
vested and ground for 5 - 10 seconds at a low speed setting in four volumes of TAS-E buffer (0.01 M EDTA, 
0.01 M NazSOa 0.1 % cysteine, 0.1 M TRIS pH 8.0) in a Waring blender. The suspension is filtered through 
cheesecloth and centrifuged for 10 minutes at 1,100 x g. Nucleocapsids are recovered from the supernatant 
after 30 minutes of centrifuging at 66,000 x g. The pellet is carefully resuspended in one volume of TAS-R buffer 
(1 % Nonidet NP-40, 0.01 M EDTA, 0.01 M Ha^SO^, 0.1 % cysteine, 0.01 M glycine, 0.01 M TRIS , pH 7.9). 
The pellet is dissolved by stirring carefully for 30 minutes at 4 °C. The supernatant is cleared by centrifuging 
for 1 0 minutes at 1 6,000 x g. Crude nucleocapsids are collected from the cleared supernatant by sedimentation 
through a 30 % sucrose cushion for 1 hour at 105,000 x g. The nucleocapsid pellet is resuspended in 400 ul 
0 01 M Na-citrate pH 6.5, layered on a 20 - 40 % sucrose (in 0.01 M Na-citrate pH 6.5) and spun for 2 hours 
at 280,000 x g. The three different opalescent bands, respectively U M and S nucleocapsid. are collected sep- 
arately. INSV RNA is recovered preferentially from purified nucleocapsid preparations by SDS-phenol extrac- 
tions followed by ethanoi precipitation. Generally , 100 ug of RNA are obtained from 100 grams of infected 
leaves. The 3'-ends of the separate INSV RNAs are labeled using RNAIigase and 5 f -pP]pCp. The end-labeled 
RNA molecules are separated on a low gelling temperature agarose gel [Wiestander, (1979) Anal Biochem 98: 
305-309] . The enzymatic approach described by Clerx-Van Haaster and Bishop [(1980) Virology 105:564-574] 
and Clerx-Van Haaster et al. [(1982) J Gen Virol 61:289-292] is used to determine the 30 terminal nucleotides 
of the 3'-and 5'-ends of both S and M RNA. 

Synthetic oligonucleotides complementary to the 3'-termini are synthesized using a commercially avail- 
able system (Applied Biosystems) and used for dideoxy-sequencing with reverse transcriptase. 

Example 3: cDNA cl ning f INSV genetic material 

Oligonucleotides complementary to the 3'-end of the S RNA are used for priming f irst strand cDNA syn- 
thesis. With th se primers, double stranded DNAto INSV RNA is synthesized according to Gubler and Hoffman 
[(1983) Gene 25:263-269]. 
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b, firs, an. second 8 Jnd cDNA ^.hesb made Z » 75'° '"i** 1 CS "'" ,> ™ S ° NA - '°"°»<* 

with T4 llgase into ,he Smal site oTpUC19 »»«-««tod us ,„g T4-DNA polymerase and llgated 

fragments are cloned into the Sma I site of pUC1 9 As " B ' Unt ended cDNA 

labeled, random.y primed first strand cDNAas ^orobe Set, o^ ' ^ 72:3961 " 3965 ] ^"9 PP- 

ern analysis followed by plasm W «XL ?^ aPP ' n8 cDNAdones are ««ected by South- 

quences of the S RNA (Figure^ * reStr ' Cti ° n map ' based on cDNA de ™ed s - 

Example 4: Sequence determination of the INSVS RNA 

SoS™ 

nucleotide INSV-S60 (5' d(AGAGCAA^GTGTCA^wh^^^ * pnmer extension °< the synthetic o.igo- 

minus. Sequence data SSS^r^T? ,yto the 15 nucleotides of the 3'-ter- 
to SEQ ID No. 12) ( nt) 13 summan2ed ^ the sequence listing (SEQ ID No.1 

reading frame is found starting at position 87 an nZZS^S^T"* } °" ^ a " ° pen 

a protein of 449 amino acids J!th ii^^S^^^ 9 ?^ ^ ? 1436 
protein, tentatively designated NSs (Figure 3/ SEQ ID No 261 The oth.r 1 P T * non-structural 

viral complementary strand from oositio'n 2W^^^^%^"^ * ,OCat , ed on tne 
peptide with a predicted molecular mass of about 28 7 kri Thic „ ' ® nC0dm9 a 262 am,no acid 'ong poly- 
capsid protein N (Figure 3/ SEQ ID ^No S^ Th.i r * I read ' n9 frame encodes the viral ""cJeo- 

complementary sUnd ToflNSV S RNA fndica^nq theTssfn^ > ^ ° f the ^ a " d the viral 

mRNAs (SEQ ID No 3 SEQ ID No 11 ra<!ni»r«!!Li f ^n! *u p . e,n 9 enes are expressed from subgenomic 
ambisense gene arrangementThi^Z^^ that a plant virus RNA has an 

mentary terminal repeats capable of forrning **S*?nn nLnH.^ !^ ^ ^ 6XiStenCe ofcom ^ 

- r s r „o5r^^ 

Example 5: Construction of an expression vector pZU-B 
or .he gene ,or iLJ^SrStT'C" e ° nS '" ,C ° n9 *— 
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Example 6: Subcl ning of the INSV N prot in gene 

The INSV N protein coding sequence is obtained by fusion of the cDNA clones plNSV-S60 and pINSV- 
S39 ( Figur 5), The cDNA clone plNSV-S60 is subjected to Spel digestion and the fragment containing the 
3'-end of the INSV N protein g ne is separated electrophoretically and purified from the gel using a DEAE mem- 
brane (NA-45 Schleicher and Schull) and cloned in the largest Spel fragment of plNSV-S39 linearized resulting 
in the recombinant plasmid pINSV-N. Primers are designed homologous to the translational start and stop co- 
don Primer INSV-066 d(GCAGATATCATGAACAAAGC) creates an EcoRV site just proximal to the start codon. 

Primer INSV-070 d(GCAACCTGCAGCTCAAATCTCTT) creates a Pstl site just distal to the stop codon. 
These primers are used in standard PCR experiments in which pINSV-N is used as the template. The resulting 
PCR fragment is isolated from the gel using a DEAE membrane (NA-45, Schleicher and Schull) and cloned in 
the Smal linearized pBluescript to generate plasmid plNSV-N2 . The added restriction sites. EcoRV and Pstl, 
facilitate the construction of further plasmids. (Alternatively, one may choose to add the sites in different ways 
such as but not limited to site-directed mutagenesis or by ligation of other synthetic oligonucleotide link rs. 
Such methods are all known to a person skilled in the art) 

Example 7: Subcloning of the INSV non-structural protein genes (NSs gene) of INSV S RNA 

The sequence of the gene corresponding to the non-structural protein NSs is isolated using RNA based 
PCR on isolated INSV S RNA. Two primers are designed which are homologous to regions spanning either 
the translational start codon or stop codon . The start codon primer contains an EcoRV site proximal to the 
ATG codon , the stop codon primer has a Pstl site just distal thereto . Purified INSV S RNA is subjected to the 
Gene AMP RNA PCR. The resulting PCR fragment is isolated from the gel and cloned into Smal Imeanzed 
pBluescript yielding the recombinant plasmid pINSV-NSs (Figure 6). 

Example 8: Subcloning of the INSV non-structural protein gene (NSm gene) of the INSV M RNA 

The sequence of the gene corresponding to the non-structural protein NSm is isolated using RNA based 
PCR on isolated INSV M RNA. Two primers are designed which are homologous to regions spanning either 
the translational start codon or stop codon . The start codon primer contains an EcoRV site prox.mal to the 
ATG codon. the stop codon primer has a Pstl site just distal thereto . Purified INSV S RNA is subjected to the 
Gene AMP RNA PCR. The resulting PCR fragment is isolated from the gel and cloned into Smal Imeanzed 
pBluescript yielding the recombinant plasmid pINSV-NSm (Figure 7). 

Example 9: Subcloning of the INSV G1/G2 glycoprotein gene (G1/G2 gene) of the INSV M RNA 

The sequence of the gene corresponding to the G1/G2 glycoprotein precursor is isolated using RNA based 
PCR on isolated INSV M RNA. Two primers are designed homologous to regions spanning either the transla- 
tional start codon or stop codon . The start codon primer contains an EcoRV site proximal to the ATG codon 
the stop codon primer has a Pstl site just distal thereto . Purified INSV M RNA is subjected to the Gene AMP 
RNA PCR. The resulting PCR fragment is isolated from the gel and cloned into Smal linearized pBluescript 
yielding the recombinant plasmid plNSV-G1/G2 (Figure 8). 

Example 10: Construction of plant transformation vectors containing INSV sequences 
Example 10A: N protein constructions in pZU-B 

In order to make a fusion in which the ATG start codon from the N protein gene is fused directly to the 3'- 
end of the TMV untranslated leader of the 35S-C1 promoter the start codon of the N gene has to be j™Jtated 
using the PCR approach as hereinbefore described. The N protein gene is excised from the plasmtd pINSV- 
N2 via an EcoRV-Pstl digestion. The fragment is isolated and inserted into the Smal-Pstl linearised pZU-B, 
resulting in recombinant plasmid pINSV-NB. The chimeric cassette containing the 35S-fi promoter, the Ngene 
and the NOS terminator is excised from the plasmid pINSV-NB via a BamHI/Xbal digestion. The tsolated chh 
maeric gene cassette is then inserted into the BamHI/Xbal linearized pBIN19 to create the binary transfor- 
mation v ctor pINSV-NBB. The resulting plasmid pINSV-NBB (Figure 9) is used in plant transformation experi- 
ments using methods well known to a person skilled in the art. 
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Exampl 10B: NSs protein gene constructions in pZU-B 

«- o™rr,;r^^^ - — » «. 3-- 

proach. Tn. p. MmM p, N sv. Ns is d£ J^^^S'.*^ SST^Z?T; P ° R ^ 

from the gel and inserted into Smal/ p«m !!„.„• w ™ => , containing fragment is isolated 

The chimeric cassette «SnS*.^ ^ Sc^" reCOmbinant «" asmid P'NSV-NSsB. 

is excised from the p.Mm?p35.^£ NS ' ^ "* N ° S terminator 

then inserted into the BamH./XbaninearfzId dbSS to ^XT"" " ^ 9606 C3SSette is 

The resulting plasmid p.NSV-NSs^ 

we., known to a person sSedm the aTt!' > ' P ' ant transfor ™ti°n experiments using methods 

Example 10C: G1/G2 glycoprotein gene constructions in pZU-B 

^^^J^^^^J^ f r T G1/G2 «'^P-e,precursor is fused 
usingthePC R ap P r 0 ach.Thep,asr p r ,^^^^ 

gene ; and the "OS 'erminato^xc^ 

"a.nexperin^ 11) is used in piant 

Example 1 0D: NSm protein gene constructions in pZU-B 

methods well known to a person skilled in the art. transformation expenments using 

Example 10E: 5'- and 3'-termini "pan-handle" constructions in pZU-B 

a r t^^ 

ecu.et'I^r/rnt"™^^ — cifically INSV RNA mol- 

Exampl 10F: C nstruction containing INSV S RAN sec o ndary structur r gi n ,„ dZU-B. 

A DNA analysis programme is used to locate a secondary structure in the vira. .NSV S RNA. The strongest 
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secondary structure detectable starts at nucleotide 1440 and ends at nucleotide 2041 of SEQ ID No.1 , (SEQ 
ID No 8). 

The DNA fragment carrying the secondary structure region is isolated from plNSV-S61 using a PCR ap- 
proach similar to that described earlier. The two primers used contain the sequences 1440-1460 and 2021- 
2041 of SEQ ID No.1. The PCR fragment is excised from an agarose gel and subsequently treated with T4 
polymerase to create blunt ends and is subsequently cloned into the Smal site of the expression vector pZU- 
B, resulting in the recombinant plasmid pINSV-HpSB . The plasmid pINSV-HpSB is digested with Hindlll and 
the fragment containing the chimeric gene is excised from an agarose gel and ligated into Xbal linearized 
pBIN19, resulting in the transformation vector pINSV-HpSBB. 

(It is clear to a person skilled in the art that other fragments can be isolated from the cDNA clones of the 
INSV S RNAcontaining the hairpin region as described above without interference to function. Also, a fragment 
containing the hairpin region may be synthesized using a DNA-synthesizer.) 

Example 11: Transformation of binary vectors to tobacco plant material 

Methods to transfer binary vectors to plant material are well established and known to a person skilled in 
the art. Variations in procedures exist due to for instance differences in used Agrobactehum strains . different 
sources of explant material , differences in regeneration systems depending on as well the cultivar as the plant 
species used . 

The binary plant transformation vectors as described above are used in plant transformation experiments 
according to the following procedures. The constructed binary vector is transferred by tri-parental mating to 
an acceptor Agrobactehum tumefaciens strain, followed by southern analysis of the ex-conjugants for verif i- 
cation of proper transfer of the construct to the acceptor strain, inoculation and cocultivation of axenic explant 

i material with the Agrobactehum tumefaciens strain of choice, selective killing of the Agrobactehum tumefa- 
ciens strain used with appropriate antibiotics, selection of transformed cells by growing on selective media con- 
taining kanamycine, transfer of tissue to shoot-inducing media, transfer of selected shoots to root inducing me- 
dia, transfer of plantlets to soil, assaying for intactness of the construct by southern analyses of isolated total 
DNAfrom the transgenic plant, assaying for proper function of the inserted chimeric gene by northern analysis 

? and/or enzyme assays and western blot analysis of proteins. 

Example 12: Expression of INSV S RNA sequences in tobacco plant cells 

RNA is extracted from leaves of regenerated plants using the following protocol. Grind 200 mg leaf material 
5 to a fine powder in liquid nitrogen. Add 800 \x\ RNA extraction buffer (1 00 mM Tris-HCl (pH 8.0), 500 mM NaCI. 
2 mM EDTA. 200 mM p-Mercapto-ethanol, 0,4% SDS) and extract the homogenate with phenol, collect the 
nucleic acids by alcohol precipitation. Resuspend the nucleic acids in 0,5 ml 10 mM Tris-HCl (pH 8,0), 1 mM 
EDTA add UCi to a final concentration of 2 M, leave on ice for maximal 4 hours and collect the RNA by cen- 
trifugation. Resuspend in 400 ul 1 0 mM Tris-HCl (pH 8,0), 1 mM EDTA and precipitate with alcohol, finally re- 
40 suspend in 50 uJ1 0 mM Tris-HCl (pH 8,0), 1 mM EDTA. RNAs are separated on glyoxal/agarose gels and blotted 
to Genescreen as described by van Grinsven etal. [(1986) Theor Appl Gen 73:94-101], INSV S RNAsequences 
are detected using DNA or RNA probes labeled with P 2 P], P 5 S] or by using non-radioactive labeling techn.ques. 
Based on northern analysis, it is determined to what extent the regenerated plants express chimaenc INSV 
S RNA sequences. 

45 Plants transformed with chimaeric constructs containing an INSV N protein-encoding sequence are also 

subjected to western blot analysis. Proteins are extracted from leaves of transformed plants by grinding in sam- 
ple buffer according to the method of Laemmii [(1970) Nature 244: 29-30] . A 50 u.g portion of protein is sub- 
jected to electrophoresis in a 12,5 % SDS-polyacryl amide gel essentially as described by Laemmii (1970) su- 
pra . Separated proteins are transferred to nitrocellulose electrophoretically as described by Towbin et al. 

so [(1 979 ) Proc. Natl . Acad. Sci. USA 76:4350-4354]. Transferred proteins are reacted with antiserum raised 
against purified INSV structural or non-structural proteins (Towbin et al.(1979) supra. Based on theresuftsof 
the western analysis, it is determined that transformed plants do contain INSV N proteins encoded by the in- 
serted chimaeric sequences . 

55 Exampl 13: Resistance f plants against INSV infection 

Transformed plants are grown in the greenhouse under standard quarantine conditions in order to prevent 
any infections by pathogens. The transformants are self-pollinated and the seeds harvested. Progeny plants 
are analyzed for segregation of the inserted gene and subsequently infected with INSV by mechanical mocu- 
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on the first two fully extended M ™ ■„„? ,f '"J**" ,he P"» se "ce of carborundum powder 

for symptom development ^SZSTCJZ'" " Se6,1 " n9S ' """"^ '" >n,S - """"^ 

p^Cd^^^^^ 

symptoms within 7 d,ys after inoculaton U " M "^»™0 control plants show severe systemic INSV 

E>a """ a 14: Us * s yn'««ic oligonucleotides for diagnostic purposes 

P"—* 9*d 1 gram oflea, materia,. 

hour, after which the ribonucfeic ac ids JelteZT > 1 ^ preparation is inc ^ated at 4»C for 1 
ed in 25 m 1 0 mM Tns-HCI T mM EDtI pt n T^^T"' ^ n ' b ° nUC,eiC add Pe " et iS resuspend - 
precip^o. The rioonucleic acid p^^££ ^^^^^^ 

sequence isolated from virions ^^ H 2^^^ l ^*' ZaBon - " Sin9 3ny part or parts 
quence information as a ^ (S^^S£^S V T ° n thS baSiS ° f disdosed se " 

detect INSV Related rnja s " " ' v ™ ™ scn P« of reg.ons of the INSV genome are used to 

- - « - ™Nn,er^ * aia9nosea *** «™« * 
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SEQUENCE LISTING 



{ I ) GENERAL INFORMAT ION : 



(i) APPLICANT: 
10 (A) NAME; SANDOZ LTD 

(3) STREET: LICHTSTRASSE 35 

(C) "CITY: BASLE 

(D) STATE: BS 

( E ) COUNTRY : S WI T ZERLAND 

(F) POSTAL CODE (ZIP) : CH-40 02 

(G) TELEPHONE: 0 61-3241111 

(H) TELEFAX: 061-3227532 

(I) TELEX: 965 050 55 



15 



25 



(A) NAME: SANDOZ— PATENT— GMBH 

(3) STREET: HUMBOLDSTRASSE 3 

(C) CITY: LORRACH 

20 (£) COUNTRY: GERMANY 

(F) POSTAL CODE (ZIP) : D-7850 

(G) TELEPHONE: 076 21/4 80 14 

(A) NAME: SANDOZ— ERF INDUNGEN VERWALTUNGSGESELLSCHAFT 
M.B.H. 

(B) STREET: B RUNNER STRASSE 59 

(C) CITY: VIENNA 

(S) COUNTRY: AUSTRIA 

(F) POSTAL CODE (ZIP): A-1230 

(G) TELEPHONE: 8 6 93 61 

(ii) TITLE OF INVENTION: IMPROVEMENTS IN OR RELATING TO ORGANIC 
30 COMPOUNDS 

(iii) NUMBER OF SEQUENCES: 27 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

35 (C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 IEFQ) 

(v) CURRENT APPLICATION DATA: 

APPLICATION NUMBER: EP 93810190.4 

40 (2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3017 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: unknown 



45 



SO 



(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: ~ 

(A) ORGANISM: IMPATIENS NECROTIC SPOT VIRUS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



AGAGCAATNN 
AATTACATTA 
CGAAGTCCTC 
TTCATGATCA 
GCAAGACCAG 
TTATAGTGAG 
GGAATGTTGT 
AAGGTCATCT 
TGCCTGAAAT 
TCATTGAAGC 
AG TAT T C AAA 
ATGAATGGCT 
TAAACACTTT 
CTCATTCGTT 
TCCCTAAAAT 
CTCCTAAAGA 
TCATTGAATA 
TTCATAAAAT 
TG7CGTATCC 
CTGTCTGTGA 
AGCCAAAAAA 
GAAAATATTT 
ATTTGGATAG 
TTTCTCCTAG 
GGTTAAACTT 
TAATAACAGA 
AAAATACAAA 
TTTGGTTGCT 
AATCTACTCA 
GTTTTTGTTT 
T TAT AT AT AT 
CTTTTTTAC? 



NNNNKNNNNN 
CTACTTCCAT 
AATCTGGGGA 
ATCTTCCGGA 
TTTCTGTTAC 
ATGTTTTGTG 
CGAAATTCTG 
CAAAGTGTTA 
TACTTCAAGA 
T GC AAA T AAA 
CAACCAATCC 
GTACACACTT 
GGCTGTAAAA 
7GTCAGGCTC 
AATGAAATCA 
G TAT TAT AT A 
- AAC T T TG AT 
TGAGATGAAT 
TCAAAGAATT 
CAGTGTTCAG 
GTTTGAGCTA 
CCTTTCTCAG 
CATCCAGATG 
AT CAGAGG AT 
GTCCCTAAGT 
TAAACCAAAA 
AAAAATCATA 
TTAATAATGC 
CAAATTGGCC 
TGTTTTATTT 
ATATATATAT 



TGTTTATTTC 



NNNNGAACAA 
TTTAACATGT 
ACAACATCTT 
AAGAAGTTGG 
ACTGGTAAAG 
CCTATTTTTG 
GTCAGATCTA 
TCCTCTCAGT 
TTCGGTCTGA 
GGATCATTGT 
ATGGGCAAAG 
AAGCCTGTTT 
TCACTGGCAA 
AATAATAACA 
AACACATACA 
AGCATTCAAT 
CAGTCAAACC 
TTTTCTGATC 
GTTCATTCAC 
CAAGACATGA 
GGGAAAAAGA 
ACCTTGAAAA 
CCCGATTGGA 
GTTTTGAAAG 
AAAGTTTGTT 
ACAATCATAT 
AAATAAATAA 
TTTAAAATGA 
AAAAATTTGT 
GTTTTTTATT 
ATTTTGTAGT 
ACACGCAAAC 



CCAAGCTACA 
CTAGTGCAAT 
CGGGTAAAGC 
TCGAAGCTCA 
TTGGCTTTCT 
ATGACATTGA 
ACACAACAAA 
TGCTCAGAAT 
AAGAATCTGA 
CTTG7GTCAA 
TCAGTGTTCT 
TTAACCAATC 
TGTCTGCAAC 
AGCCTTTTAA 
GCCGGTTCTT 
GTCTTCCGAA 
TCTTCTTGAA 
TAAAAGAACC 
TTCTTGAAAT 
TTGTCTTCAC 
CTTTAAATTA 
GTCTTCCGAG 
AAT TTG ACT A 
CTATTTCTAA 
TACATGCATT 
GAAATAAATA 
AAACCAAAAA 
ATGTATTAGT 
ATTTGTTTTT 
TTGTTTTTTG 
GGTTTTTATT 
ACACTTTCAA 



ACAAATCTTA 
GTATGAAACA 
AGTAGTAGAT 
ACTCTATTCT 
CCCAACAGAA 
TCTGAATTTC 
CACAAACGGT 
GC TTG AAGAG 
CATCTTCCCT 
AGAAGTCCTT 
TTCTCCTACC 
CCAGACCAAC 
TTCTGATTTA 
AATCAGCCTT 
CACCCTGTCT 
TC AC AAC AAT 
TCAACTCCTT 
TTACAATGTT 
CCACACAGAA 
TATAAATGAG 
TTCAGAAGAT 
AAACTCACAA 
TGCTGCAGGT 
ATTAGATTTA 
TAGATCAGAT 
AATAAACATA 
AGGATGGCCT 
AAATTATAAA 
GTTTTTGTTT 
TTTTTTGTTT 
GTTTTTATTA 
GTTTATATAT 



CAATATTGTC 
ATTATCAAAT 
AGTTATTGGA 
GACTCCAGGA 
GAAAAAGAAA 
TCCTTTTCAG 
GTTAAACATC 
CAAATAGCAG 
CCAAATAATT 
TTTGATGTCA 
AGAAGTGTTC 
AACAGGACAG 
ATGTC AG AT A 
TGGATGCGCA 
GATGAATCTT 
GTTGAAACAG 
CTAGCAGTGA 
AT C CAT GAT A 
CTTGCTCAAA 
CCAGATCTAA 
GGTTATGGGA 
ACAATGTCTT 
GAAATAAAAA 
AATTAACCTT 
TAAACAAATC 
AAATATATAA 
TCGGGCACAA 
CTTTAAATCC 
TTTGTTTTTT 
TTTATTTTAT 
TTTTTTGTAG 



60 
120 
180 
240 
300 
350 
420 
48C 
540 
600 
650 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 



T AAAAC AC AC 1920 
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10 



20 



25 



30 



ATTAAACTTA TTTCAAATAA TTTATAAAAG CACACTTAAT ACACTCAAAC AATAATTAAT 1980 

TATTTTATTT TTTATTTTAT TTTTTATTTT TATTATTTTT ATTTTTATTT ATTTAAATGC 20 4 0 

ATTTAACACA ACACAAAGCA AACCAAGCTC AAATCTCTTT TAAATAGAAT CATTTTTCCC 2100 

AAAATCAA.A GTAGCATTAA ACATGCTGTA AATGC-ATGTA AGCCCTTCTT TGTAGTGGTC 2150 

CATTGCAGCA AGTCCT TT AG CTTTCGGACT ACAAGCCTTT AGTATATCTG CATATTGTTT 2220 

AGCCTTGCCA ATTTCAACAG AGTTCATGCT ATATCCTTTG CTTTTTAGAA CTGTGCACAC 2280 

15 TTTCCCAACr GCCTCTT7AG TGCTAAACTT AGACATGTCA ATTCCAAGCT CAACATGTTT 23 40 

AGCATCTTGA TAAAT AGCCG GAACTAGTGC AGCTATTTCA AAATTCAGTA CAGATGCTAT 2400 

CAGAGGAAGA CTTCCTCCTA AGAGAACACC CAAGACACAG GATTTCAAAT CTGTGGTTGC 2 4 60 

AAGACCATAT GAGGCAATCA GAGGGTGACT TGGAAGGCTA TTTATAGCTT CAGTCAGAGC 2520 

' AGATCCATTG TCCTTTATCA TTCCAACAAG ATGAACTCTC ACCATTGCAT CAAGTCTTCG 2580 

GAAAG7CATA TCATTGACCC CAACTCTTTC TGAATTGTTT CTAGTTTTCT TAATTGTGAC 2640 

TGATCCAAAA GTGAAGTCAG CACTCTTAAT GACTCTCATT ATAGATTGCC TAT TCTTG AG 27 00 

GAAGGATAGG CAGGATGCAG TAGTCATGTT CTGAATCTTT TCACGGTTGT TGGTAAAGAA 27 60 

GTCAGTGAAA TTGAAAGACC CTTCATTTTG AGTTTCCTCA AATTCTAAGG AATCAGATTG 2820 

AG T CAAAAGC TTGAC7ATGT TCTCCTTGGT AATCTTTGCT TTGTTCATCT TGATCTGCTG 2880 

ACTTTAC7AA CTTTAAAGCT TAAAGTGTTC AAATTACTAA ATAGTACTTG CGGTTAAAGT 2 940 

AGTATTTGGT AAAATTTGTA ATTTTTCAGT TTCTAGCTTT GG AT T AT AT G ATGTTATATT 300 0 

CGTGACACAA TTGCTCT 3017 
(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2993 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

40 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: IMP AT IENS NECROTIC SPOT VIRUS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
GAACAACCAA GCTACAACAA ATCTTACAAT ATTGTCAATT ACATTACTAC TTCCATTTTA 60 

' AC ATGTCT AG~ T GCAATG TAT - GAAAC AATT A TCAAATCGAA GTCCTCAATC TGGGGAAGAA 120- 

CATCTTCGGG TAAAGCAGTA GTAGATAGTT ATTGGATTCA TGATCAATCT TCCGGAAAGA 180 

55 
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AGTTGGTCGA AC-CTCAACTC TATTCTGACT CCAGGAGCAA GACCAGTTTC TGTTACAC T G 
GTAAAGTTGG CTTTCTCCCA ACAGAAGAAA AAGAAATTAT AGTGAGATGT TTTG-GCC- 
TTTTTGATGA CATTGATCTG AATTTCTCCT T^TCAGGGAA TGTTGTCGAA ATTCTGGTC B 
GA7CTAACAC AACAAACACA AACGGTGTTA AACA7CAAGG TCATCTCAAA GTGTWCC* 
CTCAGTXGCT CAGAATGCT? GAAGAGCAAA TAGCAGTGCC TGAAA.TACT TCAAGATTCG 
GTCTGAAAGA ATCTG ACAT C TTCCCTCCAA ATAATTTCAT TGAAGCTGCA AATAAAGGAT 
CATTGTCTTG TGTCAAAGAA GTCCTTTTTG AT S7CAAGTA TTCAAACAAC CAATCCATGG 
GCAAAGTCAG TGTTCTTTCT CCTACCAGAA GTGTTCATGA ATGGCTGTAC ACACTTAAGC 
CTGTTTTTAA CCAATCCCAG ACCAACAACA GGACAG7AAA CACTt?TGGCT GTAAAATCAC 
TGGCAATGTC TGCAACTTCT GATTTAATGT CAGATACTCA TTCC-TTTGTC AGGCTCAATA 
ATAACAAGCC TTTTAAAATC AGCCTTTGGA TGCGCATCCC TAAAATAATG AAATCAAACA 
CATACAGCCG GTTCTTCACC CTGTCTGATG AATCTTCTCC TAAAGAGTAT TATATAAGCA 
TTCAATGTCT TCCGAATCAC AACAATGTTG AAACAGTCAT TGAATATAAC TTTGATCAGT 
CAAACCTCTT CTTGAATCAA CTCCTTCTAG CAGTGATTCA TAAAATTGAG ATGAATTTTT 
CTGATCTAAA AGAACCTTAC AATG7TATCC ATGATATGTC GTATCCTCAA AGAATTGTTC 
ATTCACTTCT TGAAATCCAC ACAGAACTTG CTCAAACTGT CTGTGACAGT GTTCAGCAAG 
ACATGATTGT CTTCACTATA AATGAGCCAG ATCTAAAGCC AAAAAAC-TTT GAGCTAGGGA 
AAAAGACTTT AAATTATTCA GAAGATGGTT ATGGGAGAAA A?ATTTCCTT TCTCAGACCT 
TGAAAAGTCT TCCGAGAAAC TCACAAACAA TGTCTTATTT GGATAGCATC CAGATGCCCG 
35 ATTGGAAATT TGACTATGCT GCAGGTGAAA TAAAAATTTC TCCTAGATCA GAGGATGTTT 

TGAAAGCTAT TTCTAAATTA GATTTAAATT AACCTTGGTT AAACTTGTCC CTAAGTAAAG 
TTTGTTTACA TGCATTTAGA TC AGAT T AAA CAAATCTAAT AACAGATAAA CCAAAAACAA 
TCATATGAAA TAAATAAATA AACATAAAAT ATATAAAAAA TACAAAAAAA ATCATAAAAT 
AAATAAAAAC CAAAAAAGGA TGGCCTTCGG GCACAATTTG GTTGCTTTAA TAATGCTTTA 
AAATGAATGT ATTAGTAAAT TATAAACTTT AAATCCAATC TACTCACAAA TTGGCCAAAA 
ATTTGTATTT GTTTTTGTTT TTGTTTTTTG TTTTTTGTTT TTGTTTTGTT TTATTTGTTT 
TTTATTTTGT TTTTTGTTTT TTGTTT TTT A TTTTATTTAT ATATATATAT ATATATATTT 
TGTAGTGGTT TTTATTGTTT TTATTATTTT TTGTAGCTTT TTTACTTGTT TATTTCACAC 
GCAAACACAC TTTCAAGTTT ATAT AT TAAA ACACACATTA AACTTATTTC AAATAATTTA 



TACAAAAAAA ATCATAAAAT 15 60 

1620 
1680 
1740 
1800 
1860 
1920 

TAAAAGCACA CTTAATACAC TCAAACAATA ATTAAT T ATT TTATTTTTTA TTTTATTTTT 1980 
TATTTTTATT ATTTTTATTT TTATTTATTT AAATGCATTT AACACAACAC AAAGCAAACC 2040 
AAGCTCAAAT CTCTTTTAAA TAGAATCATT TTTCCCAAAA TCAATAGTAG CATTAAACAT 



2100 
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10 



15 



GCTGTAAATG GATGTAAGCC CTTCTTTGTA GTGGTCCATT GCAGCAAGTC CTTTAGCTTT 2160 

CGGACTACAA GCCTTTAGTA TATCTGCATA TTGTTTAGCC TTGCCAATTT CAACAGAGTT 2220 

CATGCTATA7 CCTTTGCTTT TTAGAACTGT GCACACTTTC CCAACTGCCT CTTTAGTGCT . 22 30 

AAACTTAGAC ATGTCAAZTC CAAGCTCAAC ATGTTTAGCA T C T T GAT AAA TAGCCGGAAC 2 3 40 

TAGTGCAGCT ATTCCAAAAT T C AG T AC AG A TGCTATCAGA GGAAGACTTC CTCCTAAGAG 2 400 

AACACCCAAG ACACAGGATT TCAAATCTGT GGT T GCAAGA CCATATGAGG CAATCAGAGG 2 4 60 

GTGACTTGGA AGGCTATTTA TAGCTTCAGT CAGAGCAGAT CCATTGTCCT TTATCATTCC 2520 

AACAAGATGA ACTCTCACCA TTGCATCAAG TCTTCGGAAA GTCATATCAT TGACCCCAAC 2580 

TCTTTCTGAA TTGTTTCTAG TTTTCTTAAT TGTGACTGAT CCAAAAGTGA AGTCAGCACT 2640 

CTTAATGACT C TC AT TAT AG ATTGCCTATT CTTGAGGAAG GATAGGCAGG AT GC AG T AG T 2700 

CATGTTCTGA ATCTTTTCAC GGTTGTTGGT AAAGAAGTCA GTGAAATTGA AAGACCCTTC 27 60 

ATTTTGAGTT TCCTCAAATT CTAAGGAATC AGATTGAGTC AAAAGCTTGA CTATGTTCTC 2820 

CTTGGTAATC TTTGCTTTCT TCATCTTGAT CTGCTGACTT TACTAACTTT AAAGCTTAAA 288 0 

GTGTTCAAAT TACTAAATAG TACTTGCGGT TAAAGTAGTA TTTGGTAAAA TTTGTAATTT 2940 

TTCAGTTTCT AGCTTTGGAT TAT AT GAT GT TATATTCGTG AC AC AAT T GC TCT 2 993 
(2) INFORMATION FOR SEQ ID NO: 3: 

30 <i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1350 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D ) TOPOLOGY : unknown 



20 



25 



35 



40 



50 



(ii) MOLECULE TYPE: cDNA 

<iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: XMPATIENS NECROTIC SPOT VIRUS 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
ATGTC-AGTG CAATGTATGA AACAATTATC AAATC GAAGT CCTCAATCTG GGGAACAACA 
45 TCTTCGGGTA AAGCAGTAGT AG AT AGTT AT TGGATTCATG ATCAATCTTC CGGAAAGAAG 

TTGGTCGAAG CTCAACTCTA TTCTGACTCC AGGAGCAAGA CCAGTTTCTG TTACACTGGT 
AAAGTTGGCT TTCTCCCAAC AGAAGAAAAA GAAATTATAG TGAGATGTTT TGTGCCTATT 
TTTGATGACA TTGATCTGAA TTTCTCCTTT TCAGGGAATG TTGTCGAAAT TCTGGTCAGA 
TCTAACACAA CAAACACAAA CGGTGTTAAA CATCAAGGTC ATCTCAAAGT GTTATCCTCT 360 



CAGTTGCTCA GAATGCTTGA AGAGCAAATA GCAGTGCCTG AAATTACTTC AAGATTCGGT 



60 
120 
180 
240 
300 



420 
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15 



20 



25 



30 



40 



<vi) ORIGINAL SOURCE: 

(A) ORGANISM: IMPATISNS NECROTIC SPOT VIRUS 



CTGAAAGAAT CTGACATCTT CCCTCCAAAT AATTTCATTG AAGC TGC AAA TAAAGGATCA 48C 

TTGTCTTGTG TCAAAGAAGT CCTTTTTGAT GTCAAGTATT CAAACAACCA ATCCATGGGC 5 40 

10 AAAGTCAGTG TTCTTTCTCC TACCAGAAC-T GTTCATGAAT GGCTGTACAC ACTTAAGCCT 600 

GTTTTTAACC AATCCCAGAC CAACAACAGG ACAGTAAACA CTTTGGCTC-T AAAATCACTG 6 60 

GCAATGTCTG CAACTTCTGA TTTAATGTCA GAT ACT C ATT CGTTTGTCAG G C T C AAT AAT 72 0 

AAC AAGC C T T TTAAAATCAG CCTTTGGATG CGCATCCCTA AAATAATGAA ATCAAACACA 7 80 

TACAGCCGGT TCTTCACCCT GTCTGATGAA TCTTCTCCTA AAGAGTATTA TAT AAGC AT T 8 40 

CAATGTCTTC CGAATCACAA CAATGTTGAA ACAGTCATTG AAT AT AACT T TGATCAGTCA SO0 

AACCTCTTCT TG AAT C AACT CCTTCTAGCA GTGATTCATA AAAT T GAG AT GAATTTTTCT 9 60 

GATCTAAAAG AACCTTACAA TGTTATCCAT GATATGTCGT ATCCTCAAAG AATTGTTCAT 1020 

TCACTTCTTG AAAT CC AC AC AGAACTTGCT CAAACTGTCT G7GACAGTGT TCAGCAAGAC 10 80 

ATGATTGTCT T C AC TAT AAA TG AG CC AG A T CTAAAGCCAA AAAAGTTTGA GC T AGGG AAA 1140 

AAGACTTTAA AT TAT T C AG A AGATGGTTAT GGGAGAAAAT ATTTCCTTTC TCAGACCTTG 1200 

AAAAGTCTTC CGAGAAACTC ACAAACAATG TCTTATTTGG ATAGCATCCA GATGCCCGAT 12 60 

TGGAAATTTG ACTATGCTGC AGGTGAAATA AAAATTTCTC CTAGATCAGA GGATGTTTTG 132 C 

AAAGCTATTT C T AAAT T AG A TTTAAATTAA 1 3 5C 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 9 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 

55 (D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE: cDNA 
Ciii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION ; SEQ ID NO : 4 : 

45 T T AAAT AG AA TCATTTTTCC CAAAATCAAT AGTAGCATTA AACATGCTGT AAATGGATGT 60 

AAGCCCTTCT TTGTAGTGGT CCATTGCAGC AAGTCCTTTA GCTTTCGGAC TACAAGCCTT 120 

TAGTATATCT GCATATTGTT TAGCCTTGCC AATTTCAACA GAGTTCATGC TATATCCTTT 180 

^ GCTTTTTAGA ACTGTGCACA CTTTCCCAAC TGCCTCTTTA GTGCTAAACT TAGACATGTC 240 

AATTCCAAGC TCAACATGTT TAGCATCTTG AT AAAT AGCC GG AAC TAG T G CAGCTATTTC 300 



AAAATTCAGT AC AGAT GCT A T GAG AGGAAG ACTTCCTCCT AAGAGAACAC CC AAG AC AC A 3 60 



55 



20 



EP 0 566 525 A2 



GGATTTCAAA TCTGTGGTTG C AAG AC C AT A TGAGGCAATC AGAGGGTGAC TTGGAAGGCT 
ATTTAXAGCT T C AG T C AG AG CAGATCCATT GTCCTTTATC ATTCCAACAA GATGAACTCT 
CACCATTGCA TCAAGTCTTC GGAAAGTCAT ATCATTGACC CCAACTCTTT CTGAATTGTT . 
TCTAGTTTTC TTAATTGTGA CTGATCCAAA AGTGAAGTCA GCACTCTTAA TGACTCTCAT 
T AT AG AT TGC CTATTCTTGA GGAAGGATAG GCAGGATGCA GTAGTCATGT TCTGAATCTT 
TTCACGGTTG TTGGTAAAGA AGTCAGTGAA ATTGAAAGAC CCTTCATTTT GAGTTTCCTC 
AAATTCTAAG GAATCAGATT GAGTCAAAAG CTTGACTATG TTCTCCTTGG TAATCTTTGC 
TTCGTTCAT 

(2) INFORMATION FOR SEQ ID KO : 5 : 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: IMPATIENS NECROTIC SPOT VIRUS 

SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
AGAGCAATNN NNNNNNNNNN NNNNGAACAA CCCAAGC 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS ; single 

( D ) TOP O LOGY : unknown 

(ii) MOLECULE TYPE; cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: IMPATIENS NECROTIC SPOT VIRUS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

GAT TAT AT G A TGTTATATTC GTGACACAAT TGCTCT 

(2) INFORMATION FOR SEQ ID NO il : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 643 base pairs 
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(3) TYPE: nucleic acid 
fC) STRANDEDNESS : single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

10 

(iii) HYPOTHETICAL: NO 

(iii) ANT 1—3 EN'S E : NO 

(vi) ORIGINAL SOURCE : 
15 (A) DANISM: IMP AT I ENS NECROTIC SPOT VIRUS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
CCTTGGTTAA ACTTGTCCCT AAGTAAAGTT TGTTTACATG CAT T TAG AT C AG AT T AAAC A 
AATCTAATAA CAGATAAACC AAAAACAATC ATATGAAATA AATAAATAAA CATAAAATAT 
AT AAAAAAT A CAAAAAAAAT CATAAAATAA A T AAAAACC A AAAAAGGATG GCCTTCGGGC 
ACAATTTGGT TGCTTTAATA ATGCTTTAAA ATGAATGTAT T AG T AAAT T A TAAACTTTAA 
ATCCAATCTA CTCACAAATT GGCCAAAAAT TTGTATTTGT TTTTGTTTTT GTTTTTTGT<* 
TTTTGTTTTT GTTTTGTTTT ATTTGTTTTT TATTTTGTTT TTTGTTTTTT GTTTTTTAT- 
TTATTTATAT ATATATATAT AT AT AT T TT G TAGTGGTTTT TATTGTTTTT ATTATTTT^T 
GTAGCTTTTT TACTTGTTTA TTTCACACGC AAAC AC ACT T TCAAGTTTAT AT AT TAAAAC 
ACACATTAAA CTTATTTCAA ATAATTTATA AAAGCACACT TAATACACTC AAACAATAAT 
TAATTATTTT ATTTTTTATT TTATTTTTTA TTTTTATTAT TTTTATTTTT ATTTATTTAA 
ATGCATTTAA CACAACACAA AGCAAACCAA GCTCAAATCT CTT 
35 (2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS* 

(A) LENGTH: 602 base pairs 

(B) TYp E; nucleic acid 
(C> STRANDEDNESS: single 
(D) TOPOLOGY: unknown 



30 



40 



(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: IMPATIENS NECROTIC SPOT VIRUS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
50 T GG"AAACT TGTCCCTAAG TAAAGTTTGT TTACATGCAT TTAGATCAGA TTAAACAAAT 

CTAATA ACAG AT AAAC C AAA AACAAT CATA TGAAA TAAAT AAATAAACAT AAAATATATA 
AAAAAT AC AA AAAAAATCAT AAAATAAATA AAAACCAAAA AAGGATGGCC TTCGGGCACA 



420 
480 
540 
600 
€43 



€0 
_1ZQ_ 



180 
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ATTTGGTTGC TTTAATAATG CTTTAAAATG AATGTATTAG TAAATTATAA ACTTTAAATC 2 40 

CAATCTACTC ACAAATTGGC CAAAAATTTG TATTTGTTTT TGTTTTTGTT TTTTGTTTTT 30 0 
TGTTTTTGTT TTGTTTTATT TGTTTTTTAT TTTGTTTTTT GTTTTTTGTT TTTTATTTTA . 3 60 

TTTATATATA TAT AT AT AT A TATTTTGTAG TGGTTTTTAT TGTTTTTATT ATTTTTTGTA 420 

GCTTTTTTAC TTGTTTATTT CACACGCAAA CACACTTTCA AG T T T AT AT A TTAAAACACA 48 0 

CATTAAACTT ATTTCAAATA ATTTATAAAA GC AC AC T T AA T AC AC T C AAA CAATAATTAA 54 0 
15 TTATTTTATT TTTTATTTTA TTTTTTATTT TTATTATTTT TATTTTTATT TATTTAAATG 
CA 

(2) INFORMATION FOR SEQ ID NO : 9 : 



20 



25 



30 



35 



40 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3017 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNE5S : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI— SENSE : YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: IMP MIENS NECROTIC SPOT VIRUS 



600 
602 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

AGAGCAATTG TGTCACGAAT AT AAC AT CAT ATAATCCAAA GCTAGAAACT GAAAAATTAC 60 

AAATTTTACC AAAT ACT AC T TTAACCGCAA GTACTATTTA GTAATTTGAA CACT.TTAAGC 120 

TTTAAAGTTA G TAAAGTC AG CAGATCAAGA TGAACAAAGC AAAGATTACC AAGGAGAACA 180 

TAGTCAAGCT TTTGACTCAA TCTGATTCCT TAGAATTTGA GGAAACTCAA AATGAAGGGT 240 

CTTTCAATTT C ACT GACTT C TTTACCAACA ACCGTGAAAA GAT T C AG AAC ATGACTACTG 300 

CATCCTGCCT ATCCTTCCTC AAGAATAGGC AATCTATAAT GAGAGTCATT AAGAGTGCTG 360 

ACTTCACTTT TGGATCAGTC ACAATTAAGA AAACTAGAAA CAATTCAGAA AGAGTTGGGG 420 

TCAATGATAT GACTTTCCGA AGACTTGATG C AATGGT GAG AGTTCATCTT GTTGGAATGA 48 0 

45 TAAAGGACAA TGGATCTGCT CTGACTGAAG CT AT AAAT AG CCTTCCAAGT CACCCTCTGA 540 

TTGCCTCATA TGGTCTTGCA ACCACAGATT TGAAATCCTG TGTCTTGGGT GTTCTCTTAG 600 

GAGGAAGTCT TCCTCTGATA GCATCTGTAC TGAATTTTGA AATAGCTGCA CTAGTTCCGG 660 

CTATTTATCA AG ATGC T AAA CATGTTGAGC TTGGAATTGA CATGTCTAAG TTTAGCACTA 720 

AAGAGGCAGT TGGGAAAGTG TGCACAGTf C" T aAaXAGC AX"~AGG AT AT AGC ~" ATG AACTC7G 780 

TTGAAATTGG CAAGGCTAAA CAATATGCAG AT AT ACT AAA GGCTTGTAGT CCGAAAGCTA 840 
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AAGGACTTGC 7GCAATGGAC CACTACAAAG 
ATGCTACTAT TGATTTTGGG AAAAATGATT 
TTTGTGTTGT G T T AAAT GC A TTTAAA^AAA 

10 

AAA T AAAAAA 7AAAATAATT AATTATTGTT 
T T TGAAAT AA GTTTAATGTG TGTTTTAATA 
ATAAACAAGT AAAAAAGCTA C AAAAAA TAA 

75 TATATATATA 7ATATAAATA AAATAAAAAA 

TAAAACAAAA CAAAAACAAA AAACAAAAAA 
CAATTTGTGA G7AGA7TGGA TTTAAAGTTT 

20 TATTAAAGCA ACCAAATTGT GCCCGAAGGC 

ATTTTTTTTG TATTTTTTAT ATATTTTATG 
GGTTTATCTG T CAT TAG ATT TGTTTAATCT 
AGGGACAAGT TTAACCAAGG TTAATTTAAA 

25 

TCTGATCTAG GAG AAAT T T T TATTTCACCT 
TGGATGCTAT CCAAATAAGA CATTGTTTGT 
GAAAGGAAAT ATTTTCTCCC AT AAC CATC T 

30 TCAAACTTTT TTGGCTTTAG ATCTGGCTCA 

AC AC T GT C AC AG AC AGT TTG AGCAAGTTCT 
CTTTGAGGA? ACGACATATC ATGGATAACA 

35 ATCTCAATTT TATGAATCAC TGCTAGAAGG 

AAGTTATATT CAATGACTGT TTCAACATTG 
TAATACTCTT TAGGAGAAGA TTCATCAGAC 
T TC AT T AT T T TAGGGATGCG CATCCAAAGG 

40 

CTGACAAACG AATGAGTATC TGACATTAAA 
ACAGCCAAAG TGTTTACTGT CCTGTTGTTG 
GTGTACAGCC ATTCATGAAC ACTTCTGGTA 

45 TGGTTGTTTG AATACTTGAC ATCAAAAAGG 

TTTGCAGCTT . CAATGAAATT ATTTGGAGGG 
GAAGTAATTT CAGGCACTGC TATTTGCTCT 

^ AC T T T GAG AT GACCTTGATG TTTAACACCG 

ATTTCGACAA CATTCCCTGA AAAGGAGAAA 
AAACATCTCA CTATAATTTC TTTTTCTTCT 



AAGGGCTTAC ATCCATTTAC AGCATGTTTA 900 

CTATTTAAAA GAGATTTGAG CTTGGTTTGC 9 50 

T AAAAA T AAA AATAATAAAA AT AAAAAAT A 102 0 

TGAGTGTATT AAGTGTGCTT TTATAAATTA 10 30 

TATAAACTTG AAAGTGTG7T TGCGTGTGAA 11 4C 

TAAAAACAAT AAAAACC AC T ACAAAATATA 12 0C 

CAAAAAACAA AAAACAAAAT AAAAAACAAA 12 60 

CAAAAACAAA AACAAATACA AATTTTTGGC 132 0 

ATAATTTACT AAT AC ATTC 7 TTTAAAGCAT 138 0 

CATCCTTTTT TGGTTTTTAT TTATTTTATG 14 40 

TTTATTTATT TATTTCATAT GATTGTTTTT 1500 

GATCTAAATG CATGTAAACA AACTTTACTT 15 60 

TCTAATTTAG AAAT AGCTT T CAAAACATCC 1620 

GCAGCATAGT CAAATTTCCA ATCGGGCATC 1680 

GAGTTTCTCG G AAG AC - T T T CAAGGTCTGA 17 40 

TCTGAATAAT TTAAAGTCTT TTTCCCTAGC 1800 

T T TAT AGT G A AG AC AAT CAT GTCTTGCTGA 18 60 

GTGTGGATTT CAAGAAGTGA ATGAAC AAT T 1920 

TTGTAAGGTT CTTTTAGATC AGAAAAATTC 1980 

AGT TG ATTC A AGAAGAGGTT TGACTGATCA 20 4 0 

TTGTGATTCG GAAGACATTG AATGCTTATA 2100 

AGGGTGAAGA ACCGGCTGTA TGTGTTTGAT 2160 

CTGATTTTAA AAGGCTTGTT ATTATTGAGC 2220 

TCAGAAGTTG CAGACATTGC CAGTGATTTT 2280 

GTCTGGGATT GGTTAAAAAC AGGCTTAAGT 2340 

GGAGAAAGAA CACTGACTTT GC C CAT GGAT 2400 

ACTTCTTTGA CACAAGACAA TGATCCTTTA 24 60 

AAGATGTCAG ATTCTTTCAG ACCGAATCTT 2520 

TCAAGCATTC TGAGCAACTG AGAGGATAAC 2580 

TTTGTGT7TG TTGTGTTAGA TCTGACCAGA 2640 

TTCAGATCAA TGTCATCAAA AATAGGCACA 2700 

GTTGGGAGAA AGCCAACTTT ACCAGTG7AA 27 60 
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CAGAAACTGG TCTTGCTCCT GG AG TC AG AA TAGAGTTGAG CTTCGACCAA CTTCTTTCCG 2 82 0 

GAAGATTGAT CATGAATCCA ATAACTATCT ACTACTGCTT TACCCGAAGA TGTTGTTCCC 288 0 

CAGATTGAGG ACTTCGATTT GATAATTGTT TCATACATTG C AC TAG AC AT GTTAAAATGG . 2940 

AAG7AGTAAT GTAATTGACA AT AT T G T AAG ATTTGTTGTA GCTTGGTTGT TCNKNNNNNN 3CC0 

NNNNNNNNNA TTGCTCT 3017 
(2) INFORMATION FOR SEQ ID NO: 10: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29S3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 



20 



25 



<ii) MOLECULE TYPE: cDNA 

Ciii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: YES 

(vi) ORIGINAL SOURCE: 

<A) ORGANISM: IMPATIENS NECROTIC SPOT VIRUS 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 

AGAGCAATTG TGTCACGAAT ATAACATCAT AT AAT CC AAA GCTAGAAACT GAAAAATTAC 60 

30 AAATTTTACC AAAT AC T ACT TTAACCGCAA GTACTATTTA GT AAT TT G AA CACTTTAAGC L2C 

TTTAAAGTTA GTAAAGTCAG CAGATCAAGA TGAACAAAGC AAAGATTACC AAGGAGAACA 180 

TAGTCAAGCT TTTGACTCAA TCTGATTCCT TAGAATTTGA GGAAACTCAA AATGAAGGGT 240 

CTTTCAATTT CACTGACTTC TTTACCAACA ACCGTGAAAA GAT TC AG AAC ATGACTACTG 300 

35 

CATCCTGCCT ATCCTTCCTC AAGAATAGGC AAT C TAT AAT GAGAGTCATT AAGAGTGCTG 360 

ACTTCACTTT TGGATCAGTC ACAATTAAGA AAACTAGAAA CAATTCAGAA AGAGTTGGGG 420 

TCAATGATAT GACTTTCCGA AGACTTGATG CAATGGTGAG AGTTCATCTT GT T GG AAT G A 480 

40 TAAAGGACAA TGGATCTGCT C T G ACT G AAG CTATAAATAG CCTTCCAAGT CACCCTCTGA 540 

TTGCCTCATA TGGTCTTGCA ACCACAGATT TGAAATCCTG TGTCTTGGGT GTTCTCTTAG 600 

GAGGAAGTCT TCCTCTGATA GCATCTGTAC TGAATTTTGA AATAGCTGCA CTAGTTCCGG 660 

45 CTATTTATCA AG AT GCT AAA CATGTTGAGC TTGGAATTGA CATGTCTAAG TTTAGCACTA 720 

AAGAGGCAGT TGGGAAAGTG TGCACAGTTC TAAAAAGCAA AGGAT A.T AGC ATGAACTCTG 780 

TTGAAATTGG CAAGGCTAAA CAATATGCAG AT AT ACT AAA GGCTTGTAGT CCGAAAGCTA 840 

AAGGACTTGC TGCAATGGAC CACTACAAAG AAGGGCTTAC ATCCATTTAC AGCATGTTTA 9C0 

50 

" " ATGCT ACT AT TGATTTTGGG AAAAAT G AT T CTATTTAAAA GAGATTTGAG CTTGGTTTGC — 9*0- 

TTTGTGTTGT GTTAAATGCA TTTAAATAAA T AAAAAT AAA AATAATAAAA AT AAAAAAT A 1020 
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AAATAAAAAA 
TTTGAAATAA 
ATAAACAAGT 
TATATATATA 
TAAAACAAAA 
CAATTTGTGA 
TTATTAAAGC 
GATTTTTTTT 
TGGTTTATCT 
TAGGGACAAG 
CTCTGATCTA 
CTGGATGCTA 
AGAAAGGAAA 
CTCAAACTTT 
AACACTGTCA 
TCTTTGAGGA 
CATCTCAATT 
AAAGT T AT AT 
ATAATACTCT 
TTTCATTATT 
CCTGACAAAC 
TACAGCCAAA 
TGTGTACAGC 
TTGGTTGTTT 
ATTTGCAGCT 
TGAAGTAATT 
CACTTTGAGA 
AATTTCGACA 
AAAACATCTC 
ACAGAAACTG 
GG AAGATTGA 



TAAAATAATT 
GTTTAATGTG 
AAAAAAGC T A 
TATATAAATA 
C AAAAAC AAA 
GTAGATTGGA 
AACCAAATTG 
GTATTTTTTA 
GTTATTAGAT 
TTTAACCAAG 
GGAGAAATTT 
-CCAAATAAG 
TATTTTCTCC 
TTTGGCTTTA 
CAGACAGTTT 
TACGACATAT 
TTATGAATCA 
TCAATGACTG 
TTAGGAGAAG 
TTAGGGATGC 
GAATGAGTAT 
GTGTTTACTG 
CATTCATGAA 
GAATACTTGA 
TCAAT GAAAT 
TCAGGCACTG 
TGACCTTGAT 
ACATTCCCTG 
ACTATAATTT 
G7CTTGCTCC 
TCATGAATCC 



AATTATTGTT 
TGTTTTAATA 
CAAAAAATAA 
AAATAAAAAA 
AAACAAAAAA 
TTTAAAGTTT 
TGCCCGAAGG 
TATATTTTAT 
TTGTTTAA7C 
GTTAATTTAA 
TTATTTCACC 
ACATTGTTTG 
CATAACCATC 
GATCTGGCTC 
GAGCAAGTTC 
C AT GG AT AAC 
CTGCTAGAAG 
TTTCAACATT 
ATTCATCAGA 
GCATCCAAAG 
CTGACATTAA 
TCCTGTTGTT 
CACTTCTGGT 
CAT C AAAAAG 
TATTTGGAGG 
CTATTTGCTC 
GTTTAACACC 
AAAAGGAGAA 
CTTTTTCTTC 
TGGAGTCAGA 
AATAACTATC 



CCAGATTGAG GACTTCGATT TGATAATTGT 



TGAGTGTATT 
TATAAACTTG 
T AAAAAC AA T 
CAAAAAACAA 
C AAAAAC AAA 
ATAATTTACT 
CCATCCTTTT 
GTTTATTTAT 
TGATCTAAAT 
ATCTAATTTA 
TGCAGCATAG 
TGAGTTTCTC 
TTCTGAATAA 
ATTTATAGTG 
TGTGTGGATT 
ATTGTAAGGT 
GAGTTGATTC 
GTTGTGATTC 
CAGGGTGAAG 
GCTGATTTTA 
ATCAGAAGTT 
GGTCTGGGAT 
AGGAGAAAGA 
GACTTCTTTG 
GAAGATGTCA 
TTCAAGCATT 
GTTTGTGTTT 
ATTCAGATCA 
TGTTGGGAGA 
AT AGAGT TG A 
^ACTACTGCT 
TTCATACATT 



AAGTGTGCTT 
AAAGTGTGTT 
AAAAAC C ACT 
AAAACAAAAT 
AACAAATACA 
AATACATTCA 
TTGGTTTTTA 
TTATTTCATA 
GCATGTAAAC 
GAAAT AGCTT 
TCAAATTTCC 
GGAAGACTTT 
TTTAAAGTCT 
AAGACAATCA 
TCAAGAAGTG 
TCTTTTAGAT 
AAGAAGAGGT 
GGAAGAC AT T 
AACCGGCTGT 
AAAGGCTTGT 
GCAGACATTG 
TGGTTAAAAA 
ACAC T G ACT T 
ACACAAGACA 
GATTCTTTCA 
CTGAGCAACT 
GTTGTGTTAG 
ATGTCATCAA 
AAGCCAACTT 
GCTTCGACCA 
TT ACCC GAAG 
GC AC TAG AC A 



TTATAAATTA 
TGCGTGTGAA 
AC AAAAT A T A 
AAAAAACAAA 
AATTTTTGGC 
TTTTAAAGCA 
TTTATTTTAT 
TGATTGTTTT 
AAACTTTACT 
TCAAAACATC 
AATCGGGCAT 
TCAAGGTCTG 
TTTTCCCTAG 
TG7CTTGCTG 
AATGAACAAT 
CAGAAAAATT 
TTGACTGATC 
GAATGCTTAT 
ATGTGTT TG A 
TATTATTGAG 
CCAGTGATTT 
CAGGCTTAAG 
TGCCCATGGA 
ATGATCCTTT 
GACCGAATCT 
GAGAGGATAA 
ATCTGACCAG 
AAATAGGCAC 
TACCAGTGTA 
ACTTCTTTCC 

A TGT TG TTCC 

TGTT AAAAT G 



1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2C40 
2100 
2160 
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2340 
2400 
2460 
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2640 
2700 
2760 
2820 
_2.8_a.Q_ 
2940 
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GAAGTAGTAA TGTAATTGAC AATATTGTAA GATTTGTTGT AGCTTGGTTG TTC 2 99 3 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 78 9 base pairs 

(5) TYPE: nucleic acid 
<C) STRANDEDNES3 : sinale 
(D) TOPOLOGY : unknown" 
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(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: YES 

(vi) ORIGINAL . SOURCE : 

(A) ORGANISM: IMPATIENS NECROTIC SPOT VIRUS 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 

ATGAACAAAG CAAAGATTAC CAAGGAGAAC ATAGTCAAGC TTTTGACTCA ATCTGATTCC 60 

TTAGAATTTG AGGAAACTCA AAATGAAGGG TCTTTCAATT T C ACT G ACTT CTTTACCAAC 120 

AACCGTSAAA AGATTCAGAA CAT G ACT ACT GCATCCTGCC TATCCTTCCT CAAGAATAGG 180 

CAATCTATAA TGAGAGTCAT TAAGAGTGCT GACTTCACTT TTGGATCAGT CACAATTAAG 2 40 

AAAACTAGAA ACAATTCAGA AAGAGTTGGG GTCAATGATA TGACTTTCCG AAGACTTGAT 300 

30 GCAATGG7GA GAGTTCATCT TGTTGGAATG ATAAAGGACA ATGGATCTGC TCTGACTGAA 360 

GCTATAAACTA GCCTTCCAAG TCACCCTCTG ATTGCCTCAT ATGGTCTTGC AACCACAGAT 420 

TTGAAATCCT GTGTCTTGGG TGTTCTCTTA GGAGGAAGTC TTCCTCTGAT AGCATCTGTA 480 

CTGAATTTTG AAATAGCTGC ACTAGTTCCG GCTATTTATC AAGATGCTAA ACATGTTGAG 540 

CTTGGAATTG ACATGTCTAA GTTTAGCACT AAAGAGGCAG TTGGGAAAGT GTGCACAGTT 600 

CTAAAAAGCA AAGGATATAG CATGAACTCT GTTGAAATTG GCAAGGCTAA ACAATATGCA 660 

GAT AT AC T AA AGGCTTGTAG TCCGAAAGCT AAAGGACTTG CTGCAATGGA C C AC T AC AAA 720 

GAAGGGCTTA CATCCATTTA CAGCATGTTT AATGCTACTA TTGATTTTGG GAAAAATGAT 780 

TCTATTTAA 789 
(2) INFORMATION FOR SEQ ID NO: 12: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1350 base pairs 

(B) TYPE: nucleic acid 

(C) ST RAND ED NESS : single 

(D ) TOPOLOGY : unknown 



(ii) MOLECULE TYPE : CDNA 

-(i-i'i-)— HYPOTHETICAL" NO 

(iii) ANTI-SENSE: YES 



55 



27 



EP 0 566 525 A2 



<vi) ORIGINAL SOURCE: 

(A) ORGANISM: IMPATIENS NECROTIC SPOT VIRUS 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 





TTAATTTAAA 


TCTAATTTAG 


AAATAGCTTT 


CAAAACATCC 


TCTGATCTAG 


GAGAAATTTT 


£0 




TATTTCACCT 


GCAGCATAGT 


CAAATTTCCA 


ATCGGGCATC 


TGGATGCTAT 


CCAAATAAGA 


120 


15 


CAT TGT TTGT 


GAGTTTCTCG 


GAAGACTTTT 


CAAGGTCTGA 


GAAAGGAAAT 


ATTTTCTCCC 


180 


ATAACCATCT 


TCTGAATAAT 


TTAAAGTCTT 


TTTCCCTAGC 


TCAAACTTTT 


TTGGCTTTAG 


2 40 




ATCTGGCTCA 


TTTATAGTGA 


AGACAATCAT 


GTCTT3CTGA 


ACACTGTCAC 


AG AC AG T T T G 






AGCAAGTTCT 


GTGTGGATTT 


CAAGAAGTGA 


ATGAACAATT 


CTTTGAGGAT 


ACG AC AT AT C 




20 


AT GG AT AACA 


TTGTAAGGTT 


CTTTTAGATC 


AG AAAAAT T C 


ATCTCAATTT 


TATGAATCAC 






TGCTAGAAGG 


AGTTGATTCA 


AGAAGAGGTT 


TGACTGATCA 


AAGTTATATT 


CAATGACTCJT 


/on 
4CU 




TTCAACATTG 


TTGTGATTCG 


GAAGACATTG 


AAT GOT TATA 


TAATACTCTT 




D*zU 


25 


TTCATCAGAC 


AGGGTGAAGA 


ACCGGCTGTA 


TGTGTTTGAT 


TTCATTATTT 


TAGGGATGCf^ 


DUU 




CATCCAAAGG 


C7GATTTTAA 


AAGGC TTGTT 


ATTATTGAGC 


CTGACAAACG 




a £ a 




TG AC ATT AAA 


TCAGAAGTTG 


C AG AC AT TGC 


CAGTGATTTT 


ACAGCCAAAG 






30 


CCTGTTGTTG 


GTCTGGGATT 


GGTTAAAAAC 


AGGCTTAAGT 


GTGTACAGCC 


ATTCATGAAC 


780 


ACTTCTGGTA 


GGAGAAAGAA 


C AC T G ACT T T 


GCCCATGGAT 


TGGTTGTTTG 


AAT AC T TG AC 


840 




ATCAAAAAGG 


ACTTCTTTGA 


CACAAGACAA 


TGATCCTTTA 


TTTGCAGCTT 


CAATGAAATT 


900 




ATTTGGAGGG 


AAGATGTCAG 


ATTCTTTCAG 


ACCGAATCTT 


GAAGTAATTT 


CAGGCACTGC 


960 


35 


T AT TTGCTCT 


TCAAGCATTC 


TGAGCAACTG 


AGAGGATAAC 


ACTTTGAGAT 


GACCTTGATG 


1020 




TTTAACACCG 


TTTGTGTTTG 


TTGTGTTAGA 


TCTGACCAGA ATTTCGACAA 


CATTCCCTGA 


1080 




AAAGGAGAAA 


TTCAGATCAA 


TGTCATCAAA 


AATAGGCACA 


AAACATCTCA 


CTATAATTTC 


1140 


40 


TTTTTCTTCT 


GTTGGGAGAA 


AGCCAACTTT 


ACCAGTGTAA 


CAGAAACTGG 


TCTTGCTCCT 


1200 


GGAGTCAGAA 


TAG AG T T GAG 


CTTCGACCAA 


CTTCTTTCCG 


GAAGATTGAT 


CATGAATCCA 


1260 




ATAACTATCT 


ACTACTGCTT 


TACCCGAAGA 


TGTTGTTCCC 


CAGATTGAGG 


ACTTCGATTT 


1320 




GATAATTGTT 


T C AT AC AT T G 


CACTAGACAT 








1350 


45 


(2) INFORMATION FOR SEQ ID NO: 13: 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 642 base pairs 
<B) TYPE: nucleic acid 
(C) STRAND EDNESS : single 
, CD) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
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(iii) ANTI-SENSE: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: IMPATIENS NECP.OTIC SPOT VIRUS 

<xi) SEQUENCE DESCRIPTION; SEQ ID NO: 13: c 

60 



AAGAGATTTG AGCTTGGTTT GCTTTGTGTT GTG7TAAATG CATTTAAATA AATAAAAATA 

AAAATAATAA AAATAAAAAA TAAAATAAAA AATAAAATAA TTAATTATTG TTTGAGTGTA 120 

15 TTAAGTGTGC TTTTATAAAT TATTTGAAAT AAGTTTAATG TGTGTTTTAA TATATAAACT 13 0 

TGAAAGTGTG TTTGCGTGTG AAATAAACAA GTAAAAAAGC TACAAAAAAT AATAAAAACA 240 

ATAAAAACCA CTACAAAATA TAT AT AT ATA TAT AT AT AAA TAAAATAAAA AACAAAAAAC 300 

AAAAAACAAA ATAAAAAACA AATAAAACAA AACAAAAACA AAAAACAAAA AACAAAAACA 360 

AAAAC AAAT A CAAATTTTTG GCC AATTTGT GAGTAGATTG GATTTAAAGT TTATAATTTA 420 

CTAATACATT CTTTTAAAGC ATTATTAAAG CAACCAAATT GTGCCCGAAG GCCATCCTTT 4 80 

TTTGGTTTTT ATTTATTTTA TGATTTTTTT TGTATTTTTT ATATATTTTA TGTTTATTTA 540 

TTTATTTCAT ATGATTGTTT TTGGTTTATC TG TT AT TAG A TTTGTTTAAT CTGATCTAAA 600 

TGCATGTAAA CAAACTTTAC TTAGGGACAA GTTTAACCAA GG 642 
(2) INFORMATION FOR SEQ ID NO: 14: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4970 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: IMPATIENS NECROTIC SPOT VIRUS 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

AG AGC AATC A GTGCATCAAA ATTATATCTA GCCGAATTCA ATCATTATCT TCTCAATATT 60 

45 TTAATTCTTA ATCTACCGTC CAG AG AT G AA TAGTTTTTTC AAATCACTCA GATCATCTAG 120 

CAGCAGGGAG C TAG AT CACC CTAGGGTTAC AACTACCCTC TCTAAACAAG GAGCAGACAT 180 

TGTTGTACAC AATCCTTCTG CTAATCACAA CAACAAGGAA GTTCTCCAAA GAGCCATGGA 240 

TAGCTCTAAA GGGAAGATTT TGATGAACAA TACAGGCACC T CAT CACT AG GCACATATGA 300 

GTCTGACCAG~ATATCTGAAT CAGAGTCTTA-TGATCTTTCT GCT AGAATGA TTGTTGATAG 360- 

AAAT CAT C AT ATCTCCAGCT GGAAAAATGA TCTTTTTGTA GGTAATGGTG AT AAAGCTGC 420 
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AACCAAGATA ATTAAGATAC ATCCAACCTG GGATAGCAGA AAACAATACA TGATGATCTC 
AAGGATAGTT ATCTGGATAT GCCCTACTAT AGCTGATCCT GATGGGAAAT TGGCTGTAGC 
TTTAATTGAT CCTAACAAGA GTGTTAATGC CAGAACTGTT T TG AAAGGGC AAGG AAG CAT 
TAAAGATCCr ATATGTTTTG TTTTTTATCT AAAXTGGTCC ATT C C AAAAG TTAACAACAC 
TTCAGAGAAT TGTGTTCAGC TTCATTTATT ATGTGATCAA G T T T AC AAG A AAGATGZTTC 
TTTTGCTAGT GTCATGTATT C T TGG AC AAA AGAATTCTGT GATTCACCAA GAGCAGATCT 
GG AT AAAAGC TGCATGATAA TACCCA7CAA TAGGGCTATT AGAGCCAAAT CGCAAGCCTT 
CATTGAAGCC TGCAAGTTAA TCATACCTAA AGGCAATTCT GAAAAGCAAA T T AG AAG AC A 
ACTTGCAGAG CTAAGTGCTA AT T T AG AG AA ATCTGTTGAA GAAGAGGAGA ATGTTACTGA 
T AAC AAG AT A GAGATATCAT TTGATAATGA AATCTAAATA TGTTTTCATT TAATAATAAA 
TAATATATAT TGTTCATAAT ATTTTGAATG 7TTAAGTAAA AAATAAAGCA AGATAAAAAA 
C TAT AT A TAT ATATATATA7 AGAAGTAT AA AATATATATG TATTTGTGTT TAAAAACAAA 
TCAAAAACCA AAAAAGAAAA AAGAAAAAAT AAACAAAAAA CAAAAACAAA AACAAAAACA 
AACAAAAAGC AAAAAATAGA AAAAAGTTGA AAAAAACCAA AAAAATTTTT TTTGTAAATA 
AATAAGGCTC CGGCCAGATT TGGTCTAAGA CCTTTTTATT TGTTTTTATA CAT T TT AT T T 
GTTTTTGTTG ATTTTTATTT TTATTATTTT TATATTTTTT ATATAGTTTG CTTATTTAAC 
AC T TAT T TAG ACAAAT T AAA TTTATTTGAT T AC AATCAT T CTGCCTTATT TAATTTAAAA 
CACATTTGGT GTATATTCCA ATGAATTTAA TCATATACC3 CTGAAGTCTA GAGGAGGTCT 
TCTTCTAGTG A7GGTGTCTT TACCAGAAGA CGTGGAAACC AAAGAATAAT CATTAGTGTC 
TTCAATATAT TTTGTCTTGT AAGACTTGTT TC T AAC AT AG CCTCTACACA TTGTGGCAAC 
AATAGAGCAG AGGTAAGCAA GAGCAAATAC AAAGAGTATG AGCAATACTA CTCTGACTGT 
ATCAAAGAAG GATCCAAAGT GGCTTGCTAT AAAGTTAAAA GGGC TTT T AA CATAGTCCCA 
AAAGCTCCAA ACTGATGTGT CAGAATTATA TTGCTGTTCC TCGTGTGCAT GTTGGTCATT 
TTGATCAATT ATGTTTTCTG GTTCCAGCAC AGCAACAGAA TCTACAAGTG CCTCAACTGA 
GTATGATTTG TCTCCTTCTG GTTCTATAAT CATTTTTTGT TTTTCTGGGT TAGAAGTGCA 
GAACATTGTC AAG T TAT AC T TATTAGCACC TTTCTTTACT GCTATCTGGT ATGTTGACAA 
TGAACATTGT TTCATGGTTA ACCTTGCAGA AAAAGTTATG TCTGATATAA AT G AGGCAGC 
ACACCTCAGC CCTTGGCTAC ATAAGAAACA TCCCTTACAG CTTAAAGAGA C AG AAC T CAA 
TATAGGCTTT TTTGGTACAG TTTTAAACAA TTCAGAAGGT AGATCCAAAA CAATTTTAAG 
CTTACCTAGA CTAAAGATCT TTTCCATATA AAAACTATTC TGGTCAGTAA ACTGAACTGG 
AATGTC CGAT ATTTGGTTCA A ACCTGTTTT AAATC TG T AT GTGTCATAAC CACATGATTT_ 
7ATCGTAATT GTTTTTTTAC CAATTGCTGA ACAATCCCAG GACAGATCGT TTGTATCTAA 



480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
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1320 
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1440 
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1920 
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2160 
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TGTTTTCTTA GAGAAAATGG GATCACCTTG GTGTGAAAGT TGAGGATGAC CAAACATTTT 
TGATGGATTA TTTAATCTAG CTATGTTTCC CG CAT AT AC G TGACTATCAG GTCCATGAGC 
TATCAGCTGG CCTATTGTTA AGCCATCATT ATGGAAATCC GCTAATATAT CAGCCTGGAA 
AT AT CC? GAT T C AG AT GGG A CTTCCTCAGA TACAGTGAAA CACTTTGCTC CCACAAATCC 
AG AT AT AC AT AC T TC AG ACT TGATTGTTGA TTTAATAACA GAATAAATCC TGAAAGATTG 
ATCCATATCA TACACATTTC TACAAAACCC ACAAGTGGCT CCTTCATTGA TAGCCAAACA 
CCAAACCTCT TCACAACCCC AGTAAGATGT TGGTGTTATG CAGAAATCTT GATACCCAGT 
TATCGGTTGT TCTTTTCTGC AATCTGAGCA TTTACCTGTG CATGTTGAAA AGAAATCAGT 
GTGGGTGCTT TGTATAGGAG CTGTAGTGTA TTGTTCAGAA ACATCATACT GTATTCTAAC 
TTTTTTAATA TAAACAACAA ACTTCTGAGC AGTGCTAGAA CTTTTGTCAT TAAGAGAGAA 
AACTGTGCCC CC AC C T GAT A ATAAAGATTC TTCTATCATG TATCTATATT TTCCATCTAT 
CACCGAGTCA AATATGAGAG ATTTTCTTGG AAAAATGCTT TCAGGTATGT CTGATTCATT 
AGATTTAAGT GCATCTCCAG AAATGTATCC ATATTTTTCA GTTTTATTGT AGAAATCAAT 
TATACCATTC CTAAGCCTTT TCATGAAGTG TAGATTCACA GCATTCAATC CCAATGTGTC 
ACCAGAATAT TCTAAGAACC CATTATCTAA AGGCTTGCTT TGGAAAATAG AGGC AT ACT C 
ACAACCAAAT CTGCATTTGA CAAAAGTTAC TAAAGCATTT TCAGTTATCC TGCCTTTGCA 
TTCTTGATAA GG T AT ACAAT CCATAGGACC TTCTGTCACA ACATTGGTTA GAAAGTTAGA 
TTCTACAATA GAATTTTCTT TAATAGCACA GAAGCATTGG TCTTTTTCAG GACATTTGTC 
ATATCTGTTT GTAACAAAGC GGTCACAACC AGGGACATAA TAACAGCTAT CCAAACACTG 
AGCAGTTTGA GCCATAGACA TAGGCATCTG TGACAAAATC AGAAATCCTA TCAAAGTTTC 
TGTGACTGCT TTTAGGAAAG AGAGGCCTAT TTTTGTATTA ACTATCAAAT GGAACCATTC 
AATGCTAGCC C AGTTGT ATT TTTTATTCTT CTCTGCTGTT CTAGTTATTA TAGGACATTC 
TTCTGAGTGT TCTTCAGAGG CTTTGTTTTT GTTACAAATG CATAATTTTG AGCATTCATG 
GGTTACCAAA CATAAATTTC CACAGACCTT ACATTTCAAG GGAAAATAAG ACCATAAATA 
ATTTATCAGT AGTAGTATAG GATACGTTAT CAATCCCAGA AGATCATACC CAT AGAAC AG 
TGTTTTAGAT GTTTTGTTTA CCAAGTACCT TATAGGGAAA TAGACAATCA GAGCAATCAT 
GATCAATCTA AAC CAT GAGA AGTTGATGCA AGCAGTTTGT TTGTAAATAT TTTTGGAGTA 
CTTTATAATA CAATCTCTAA CTCTTTTGTC CACTAAAGGA ACTTTAGAAG ACTTGTCACC 
GCACAATAGG TTATGCTTAC CATCCATATT TTCTTCTGTG AAAGT C AAAC TAACTGAGCC 
AG AG AAGCT T ATTATGGAAT GGCTCATGTC ACTTCCTTCT CTTTTGACGA CGTAACCCAT 
GATTTTCTCA GGTGTAGTTA ATGAAACTGT ATAAGAATTA ACTATGTTTG TTTTTGATAT 
TTTACAATCA CCTGAGAATT TCACACTCTG GAGAGAGACT GTGCCATTAG TTGGTCTAGA 



2400 
246C 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3X80 
3240 
3300 
3360 
3420 
3480 
' 3540 
3600 
3660 
3720 
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3840 
3900 
3960 
4020 
4080 
4140 
4200- 
4260 
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ATTGTACATG ATTGGATAAT TGTAATTCTC CAAACTTTCA ATTATATAGA AT TTAGTTCC 
TATAGATAAT TTCCTTT7GT IATCGATTT? TGTTATTGGT ACAACTGGAA CAGTTTCAAA 
GCTTCTTGGC AATTCAGAAG ATCCTTCACA GTTTCCCAAT TT AG TT AT AG TGTCACTGAT 
ACATGAATAT AT AACAC CAT TGCTTTCTAC TTGGTAATAA ACATTGAATG TTGAAACTCC 
TTTAATGC7A CAAGTCAAAC T TG AAGC AT T TAGGCATGGA TTTGGTAAAT CCATAACTGA 
TATAGTTGTT GGTGTAC-AAG ACAATCCACT TGGAGATTGA GGTACCTCAT TATTGGCAAG 
AACAGTTTGA GTATCTCGTG TTGGTCTAAG GGTTTTACCT GTTGCATTCT GGAGCATTTC 
AGCCAAAGTA TCTAGAATTT CATTTTTATG AT C T AC AG AA CGGTCATAAT AAGCTTCATC 
ATAAATTTCT GGATGATCGC CCCTTTCAAC ATGAATCTTT GCA7CTGTCT CCTTTAATGC 
CATAAAGGAT AAGATAACAG AAGTAACAAC TAGTGTACAT ACACTAATTT TAACAAGTAA 
CTCGCACATC TTTAGAATTT TCATTCTAAA AAGTCGAATA ACACTAGTTC TAAAATTGCT 
TTATGAG7TT GATCTGTTGT ATGTAGAGTT TTGTTTGCAC TGATTGCTCT 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS * 

(A) LENGTH: 912 base oairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(DJ TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

<iii) ANTI-SENSE: NO 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: IMPATIENS NECROTIC SPOT VIRUS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

ATGfl 
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30 
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4320 

4380 

4440 

4500 

455G 

4620 

4630 

4740 

4800 

436C 

4920 

4970 



ATGAATAGTT 


TTTTCAAATC 


ACTCAGATCA 


TCTAGCAGCA 


GGGAGCTAGA 


TCACCCTAGG 


60 


GTTACAACTA 


CCCTCTCTAA 


ACAAGGAGCA 


GACATTGTTG 


TACACAATCC 


TTCTGCTAAT 


120 


CACAACAACA 


AGGAAGTTCT 


CCAAAGAGCC 


ATGGATAGCT 


CTAAAGGGAA 


GATTTTGATG 


180 


AACAATACAG 


GCACCTCATC 


ACTAGGCACA 


TATGAGTCTG 


ACCAGATATC 


TGAATCAGAG 


240 


TCTTATGATC 


TTTCTGCTAG 


AATGATTGTT 


GATACAAATC 


ATCATATCTC 


CAGCTGG AAA 


300 


AATGATCTTT 


TTGTAGGTAA 


TGGTGATAAA GCTGCAACCA 


AGATAATTAA 


GATACATCCA 


360 


ACCTGGGATA 


GC AG AAAACA ATACATGATG 


ATCTCAAGGA 


TAGTTATCTG 


GATATGCCCT 


420 


AC TAT AGCTG 


ATCCTGATGG 


GAAATTGGCT 


GTAGCTTTAA 


TTGATCCTAA 


CAAGAGTGTT 


480 


AATGCCAGAA 


CTGTTTTGAA- 


AGGGC AAGGA 


AGCATTAAAG- 


ATCCTATATG" 


TTTTGTTTTT 


540 


TATCTAAATT 


GGTCCATTCC 


AAAAGTTAAC 


AACACTTCAG 


AGAATTGTGT 


TCAGCTTCAT 


6C0 
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TTATTATGTG ATCAAGTTTA CAAGAAAGAT GTTTCTTTTG CTAGTGTCAT GTATTCTTGG 6 60 

ACAAAAGAAT TCTGTGATTC ACCAAGAGCA GATCTGGATA AAAG C T G C AT GA.TAATACCC 720 

ATCAATAGGG CTATT AGAGC CAAATCGCAA GCCTTCATTG AAGCCTGCAA GTTAATCATA 730 

C C T AAAGG C A ATTCTGAAAA GC AAA T T AG A AGACAACTTG CAGAGCTAAG TGCTAATTTA 840 

GAGAAATCTG TTGAAGAAGA GGAGAATGTT ACTGATAACA AG AT AG AG AT ATCATTTGAT 900 

AATGAAATCT AA 912 



(2) INFORMAT ION FOR SEQ ZD NO : 1 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 3 base pairs 
(E) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: IMPATIENS NECROTIC SPOT VIRUS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



ATATGTTTTC ATTTAATAAT AAATAATATA TATTGTTCAT AATATTTTGA ATGTTTAAGT 60 

AAAAAATAAA GCAAGATAAA AAAC T AT ATA TAT AT AT ATA TATAGAAGTA T AAAAT AT AT 120 

ATGTATTTGT GTTTAAAAAC AAATCAAAAA CCAAAAAAGA AAAAAGAAAA AAT AAAC AAA 180 

AAACAAAAAC AAAAACAAAA ACAAACAAAA AGCAAAAAAT AGAAAAAAGT TGAAAAAAAC 240 

CAAAAAAATT TTTTTTGTAA ATAAATAAGG CTCCGGCCAG ATTTGGTCTA AGACCTTTTT 300 

ATTTGTTTTT ATACATTTTA TTTGTTTTTG TTGATTTTTA TTTTTATTAT TTTTATATTT 3 60 

T T T AT AT AG T TTGCTTATTT AACACTTATT TAGACAAATT AAATTTATTT GAT T AC AAT C 420 

ATTCTGCCTT ATTTAATTTA AAACACATTT GGTGTATATT CCAATGAATT TAA 473 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3414 base pairs 

(B) TYPE; nucleic acid 

(C) ST RAND ED NESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: IMPATIENS NECROTIC SPOT VIRUS 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 





TCATATACCG 


CTGAAGTCTA 


GAGGAGGTCT 


TCTTCTAGTG 


ATGGTGTCTT 


TACCAGAAGA 


60 


10 


CGTGGAAACC 


AAAGAATAAT 


CATTAGTGTC 


TTCAATATAT 


TTTGTCTTGT 


AAGACTTGTT 


120 




TCTAACATAG 


CCTCTACACA 


TTGTGGCAAC 


AATAGAGCAG 


AGGTAAGCAA 


GAGCAAATAC 


180 




AAAGAGTATG 


AGCAATACTA 


CTCTGACTGT 


ATCAAAGAAG 


GATCCAAAGT 


GGCTTGCTAT 


240 


15 


AAAGTTAAAA 


GGGCTTTTAA 


CATAGTCCCA 


AAAGCTCCAA 


ACTGATGTGT 


CAGAATTATA 


3C0 




TTGCTGTTCC 


TCGTGTGCAT 


GTTGGTCATT 


TTGATCAATT 


ATGTTTTCTG 


GTTCCAGCAC 


360 




AGCAACAGAA 


TCTACAAGTG 


CCTCAACTGA 


GTATGATTTG 


TCTCCTTCTG 


GTTCTATAAT 


420 


20 


CATTTTTTGT 


TTTTCTGGGT 


TAGAAGTGCA 


GAACATTGTC 


AAGTTATACT 


T AT T AGCACC 


480 


7TTCTTTAC7 


GCTATCTGGT 


ATGTTGACAA 


TGAACATTGT 


TTCATGGTTA 


ACCTTGCAGA 


540 




AAAAGTTATG 


TCTGATATAA 


ATGAGGCAGC 


ACACCTCAGC 


CCTTGGCTAC 


ATAAGAAACA 


60C 




TCCCT T AC AG 


CTTAAAGAGA 


CAGAAC TCAA 


TATAGGCTTT 


TTTGGTACAG 


TTTTAAACAA 


660 


25 


TTCAGAAGGT 


AGATCCAAAA 


CAATTTTAAG 


CTTACCTAGA 


CTAAAGATCT 


TTTCCATATA 


720 




AAAACTATTC 


TGGTCAGTAA 


AC TGAACTGG 


AATGTCCGAT 


ATTTGGTTCA 


AACCTGTTTT 


73C 




AAATCTGTAT 


GTGTCATAAC 


CACATGATTT 


TATCGTAATT 


GTTTTTTTAC 


CAATTGCTGA 


84C 


30 


ACAATCCCAG 


GACAGATCGT 


TTGTATCTAA 


TGTTTTCTTA 


GAGAAAATGG 


GATCACCTTG 


900 




GTGTGAAAGT 


TGAGGATGAC 


CAAACATTTT 


TGATGGATTA 


TTTAATCTAG 


CTATGTTTCC 


960 




CGCATATACG 


T G AC TAT C AG 


GTCCATGAGC 


TATCAGCTGG 


CCTATTGTTA 


AGCGATC AT T 


1020 


35 


ATGGAAATCC 


GCTAATATAT 


CAGCCTGGAA 


ATATCCTGAT 


TCAGATGGGA 


CTTCCTCAGA 


10SO 


TACAGTGAAA 


CACTTTGCTC 


CCACAAATCC 


AGATATACAT 


AC T T C AG AC T 


TGATTGTTGA 


1140 




TTTAATAACA 


GAATAAATCC 


TGAAAGATTG 


ATCCATATCA 


T AC AC ATT T C 


TACAAAACCC 


1200 




ACAAGTGGCT 


CCTTCATTGA 


TAGCCAAACA 


CCAAACCTCT 


TCACAACCCC 


AGTAAGATGT 


1260 


40 


TGGTGTTATG 


CAGAAATCTT 


GATACCCAGT 


TATCGGTTGT 


TCTTTTCTGC 


AATCTGAGCA 


1320 




TTTACCTGTG 


CATGTTGAAA 


AGAAATCAGT 


GTGGGTGCTT 


TGTATAGGAG 


CTGTAGTGTA 


1380 




TTGTTCAGAA 


ACATCATACT 


GTATTCTAAC 


TTTTTTAATA 


TAAACAACAA 


ACTTCTGAGC 


1440 


45 


AGTGCTAGAA 


CTTTTGTCAT 


TAAGAGAGAA 


AACTGTGCCC 


CCACCTGATA 


AT AAAGAT TC 


1500 




TTCTATCATG 


TATCTATATT 


TTCCATCTAT 


CACCGAGTCA 


AATATGAGAG 


ATTTTCTTGG 


1560 




AAAAATGCTT 


TCAGGTATGT 


CTGATTCATT 


AGATTTAAGT 


GCATCTCCAG 


AAATGTATCC 


1620 


50 


ATATTTTTCA 


GTTTTATTGT 


AGAAATCAAT 


TATACCATTC 


CTAAGCCTTT 


TCATGAAGTG 


1680 


TAGATTCACA 
AGGCTTGCTT 


GCATTCAATC 
TGGAAAAT AG 


CCAATGTGTC 


ACCAGAATAT 


TCTAAGAACC 


CAT TAT C T AA 


1740 




AGGCATACTC 


ACAACCAAAT 


CTGCATTTGA 


CAAAAGTTAC 


1300 




TAAAGCATTT 


TCAGTTATCC 


TGCCTTTGCA 


TTCTTGATAA 


GGTATACAAT 


CCATAGGACC 


1950 
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25 



TTCTGTCACA ACATTGGTTA GAAAGTTAGA TTCTACAATA GAATTTTCTT TAATAGCACA 1920 

GAAGCATTGS TCTTTTTCAG GACATTTGTC ATATCTGTTT GTAACAAAGC GGTCACAACC 1980 

AGGGACATAA TAACAGCTAT CCAAACACTG AGCAGTTTGA GC CAT AG AC A. TAGGCATCTG . 20 4 0 

TGACAAAATC AGAAATCCTA TCAAAGTTTC TGTGACTGCT TTTAGC-AAAG AG AGGCCT AT 2 ICO 

TTTTGTATTA ACTATCAAAT GGAACCATTC AATGCTAGCC CAGTTGTATT TTTTATTCTT 2160 

CTCTGCTGTT CTAGTTATTA TAGGACATTC TTCTGAGTGT TCTTCAGAGG CTTTGTTTTT 2220 

15 GTTACAAATG CATAATTTTG AGCATTCATG GGTTACCAAA CATAAATTTC C AC AG ACCT T 2280 

ACATTTCAAG GGAAAATAAG AC C AT AAAT A ATTTATCAGT AG T AG T AT AG GATACGTTAT 2340 

CAATCCCAGA AG ATC AT ACC CAT AG AAC AG TGTTTTAGAT GTTTTGTTTA CCAAGTACCT 2400 

TAT AGGG AAA TAGACAATCA GAGCAATCAT GAT CAATCT A AACCATGAGA AGTTGATGCA 2460 

AGCAGTTTGT TTGTAAATAT TTTTGGAGTA CTTTATAATA CAATCTCTAA CTCTTTTGTC 2520 

CACTAAAGGA ACTTTAGAAG ACTTGTCACC GCACAATAGG TTATGCTTAC CATCCATATT 2530 

TTCTTCTGTG AAAGTCAAAC TAACTGAGCC AGAGAAGCTT ATTATGGAAT GGCTCATGTC 2 640 

ACTTCCTTCT CTTTTGACGA CGTAACCCAT GATTTTCTCA GGTGTAGTTA ATGAAACTGT 2700 

ATAAGAATTA ACTATGTTTG TTTTTGATAT TTTACAATCA CCTGAGAATT TCACACTCTG 27 60 

GAGAGAGACT GTGCCATTAG TTGGTCTAGA ATTGTACATG ATTGGATAAT TGTAATTCTC 2820 
30 CAAACTTTCA ATT AT AT AG A ATTTAGTTCC T AT AG AT AAT TTCCTTTTGT TATCGATTTT 2880 
TGTTATTGGT ACAACTGGAA CAGTTTCAAA GCTTCTTGGC AATTCAGAAG ATCCTTCACA 2 940 

GTTTCCCAAT TTAGTTATAG TGTCACTGAT ACATGAATAT AT AACACCAT TGCTTTCTAC 3000 
TTGGTAATAA AC ATT G AAT G TTGAAACTCC TTTAATGCTA CAAGTCAAAC TTGAAGCATT 30 60 

TAGGCATGGA T TTGGT AAAT CCATAACTGA TATAGTTGTT GGTGTAGAAG AC AATCC AC T 3120 
TGGAGATTGA GGTACCTCAT TATTGGCAAG AACAGTTTGA GTATCTCGTG TTGGTCTAAG 3180 
GGTTTTACCT GTTGCATTCT GGAGCATTTC AGCCAAAGTA TCTAGAATTT CATTTTTATG 3240 
ATCTACAGAA CGGTCATAAT AAGCTTCATC AT AAAT T TCT GGATGATCGC CCCTTTCAAC 3300 
ATGAATCTTT GCATCTGTCT CCTTTAATGC CATAAAGGAT AAG AT AAC AG AAGTAACAAC 3360 
TAGTGTACAT AC ACT AAT TT TAACAAGTAA CTCGCACATC TTTAGAATTT TCAT 3414 
45 (2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

50 



35 



40 



(ii) MOLEOTLE TYPE: cDNA 
(iii) HYPOTHETICAL : NO 
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(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: I MP AT I ENS NECROTIC SPOT VIRUS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
AGAGCAATCA GTGCATCAAA ATTATATCTA GCCGAA 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS * 
(A) LENGTH: 3 6 base oairs 
(3) TYPE: nucleic acid 

(C) STRAKDEDNESS; single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

{vi) ORIGINAL SOURCE: 

(A) ORGANISM: IMP AT I ENS NECROTIC SPOT VIRUS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 9 : 
CTGTTGTATG TAGAGTTTTG TTTGCACTGA TTGCTC 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 970 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: IMP AT I ENS NECROTIC SPOT VIRUS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
AGAGCAATCA GTGCAAACAA AACTCTACAT ACAACAGATC AAACTCATAA AGCAATTTTA 
GAACTAGTGT TATTCGACTT TTTAGAATGA AAATTCTAAA GATGTGCGAG TTACTTGTTA 
AAATTAGTGT ATGTACACTA GTTGTTACTT CTGTTATCTT ATCCTTTATG GCATTAAAGG 
AGACAGATGC AAAG AT TCAT GTTGAAAGGG GCGATCATCC AGAAATTTAT GATGAAGCTT 
ATTATGACCG -TTCTGTAGAT— CATAAAAATG - AAATTCTAGA TACTTTGGCT GAAATGCTCC 
AGAATGCAAC AGGTAAAACC CTTAGACCAA C AC GAG AT AC TCAAACTGIT CTTGCCAATA 
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20 



25 



30 



35 



40 



45 



50 



ATGAGGTACC TCAATCTCCA AGTGGATTGT CTTCTACACC AACAACTATA TCAGTTATGG 
ATTTACCAAA TCCATGCCTA AATGCTTCAA GTTTGACTTG T AG CAT T AAA GGAGTTTCAA 
CATTCAATGT TTATTACCAA G T AG AAAGC A ATGGTGTTAT ATATTCATGT ATCAGTGACA 
CTATAACTAA ATT GGG AAAC TGTGAAGGAT CTTCTGAATT GCCAAGAAGC TTTGAAACTG 
TTCCAGTTGT ACCAATAACA AAAAT CG AT A ACAAAAGGAA ATT ATC TATA GGAACTAAAT 
TCTATATAAT TGAAAGTTTG GAGAATTACA ATTATCCAAT CATGTACAAT TCTAGACCAA 
CTAATGGCAC AGTCTCTCTC CAGAGTGTGA AATTCTCAGG TGATTGTAAA AT ATC AAAAA 
C AAAC AT AG T TAATTCTTAT ACAGTTTCAT TAACTACACC T GAG AAAAT C ATGGGTTACG 
TCGTCAAAAG AG AAGG AAG T GACATGAGCC ATTCCATAAT AAGCTTCTCT GGCTCAGTTA 
GTTTGACTTT C AC AG AAG AA AAT ATGGATG GTAAGCATAA CCTATTGTGC GGTGACAAGT 
CTTCTAAAGT TCCTTTAGTG GACAAAAGAG TTAGAGATTG TATTATAAAG TACTCCAAAA 
AT AT TT AC AA ACAAACTGCT TGCATCAACT TCTCATGGTT TAGATTGATC . ATGATTGCTC 
TGATTGTCTA TTTCCCTATA AGGTACTTGG TAAACAAAAC AT C T AAAAC A CTGTTCTATG 
GGTATGATCT TCTGGGATTG ATAACGTATC CTATACTACT ACTGATAAAT TATTTATGGT 
CTTATTTTCC CTTGAAATGT AAGGTCTGTG GAAATTTATG TTTGGTAACC CATGAATGCT 
CAAAATTATG CATTTGTAAC AAAAACAAAG CCTCTGAAGA ACACTCAGAA GAATGTCCTA 
TAATAACTAG AACAGCAGAG AAGAATAAAA AATACAACTG GGCTAGCATT GAATGGTTCC 
ATTTGATAGT TAATACAAAA ATAGGCCTCT CTTTCCTAAA AGCAGTCACA GAAACTTTGA 
TAGGATTTCT GATTTTGTCA CAGATGCCTA TGTCTATGGC TCAAACTGCT CAGTGTTTGG 
ATAGCTGTTA TTATGTCCCT GGTTGTGACC GCTTTGTTAC AAACAGATAT GACAAATGTC 
CTGAAAAAGA CCAATGCTTC TGTGCTATTA AAG AAAAT TC TATTGTAGAA TCTAACTTTC 
TAACCAATGT TGTGACAGAA GGTCCTATGG ATTGTATACC T T ATC AAG AA TGCAAAGGCA 
GGATAACTGA AAATGCTTTA GTAACTTTTG TCAAATGCAG ATTTGGTTGT GAGTATGCCT 
CTATTTTCCA AAGCAAGCCT TTAGATAATG GGTTCTTAGA ATATTCTGGT GACACATTGG 
GATTGAATGC TGTGAATCTA CACTTCATGA AAAGGCTTAG GAATGGTATA AT T GAT T TCT 
ACAATAAAAC T GAAAAAT AT GGATACATTT CTGGAGATGC ACTTAAATCT AAT G AAT C AG 
ACATACCTGA AAGCATTTTT CCAAG AAAAT CTCTCATATT TGACTCGGTG ATAGATGGAA 
AAT AT AG AT A CATGATAGAA GAATCTTTAT TATCAGGTGG GGGCACAGTT TTCTCTCTTA 
AT G AC AAAAG TTCTAGCACT GCTCAGAAGT TTGTTGTTTA TATTAAAAAA GTT AG AAT AC 
AGTATGATGT TTCTGAACAA TACACTACAG CTCCTATACA AAGCACCCAC ACTGATTTCT 



420 
480 
540 
600 
560 
720 
78C 
840 
930 
960 
1C20 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
17 40 
1800 
1860 
1920 
1980 
2040 
2100 
2160 



TTTCAACATG CACAGGTAAA TGCTCAGATT GCAGAAAAGA AC AACCG AT A ACTGGGTATC 
AAG AT T TC TG CATAACACCA ACATCTTACT GGGGTTGTGA AGAGGTTTGG TGTTTGGC T A 



222C 
2280 
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TCAATGAAGG 


AGCCACTTGT 


GGGTTTTGTA 


GAAATG^GTA 


— OA 1 A 1 OO A 1 


LAA a LTTTCA 


234 0 




GGATTTATTC 


TGTTATTAAA 


TCAACAATCA 


AGTC TGA-AGT 


ATGT1T A rpf— rp 
AXOlAlAlLx 


GGATTTGTGG 


2400 


in 


GAGCAAAGTG 


TTTCACTGTA 


TCTGAGGAAG 


TCCC VTCTGA 

■* wV^*^XVX w X VJXV 


A TP A firs tTST 
A X V.ACUA 1 A 1 




2460 




ATATATTAGC 


GGATTTCCAT 


AATGATGGCT 


T AAP AAT 2 GG 

x xvxv\ HF< rvxv x **vj\j 


CUAvv 1 OA 1 A 


GCTCATGGAC 


2 52 0 




CTGATAGTCA 


CGTATATGCG 


. GGAAACATAG 


C TAG ATT AAA 

w aV\m XV X X XVXVXV 


iAAlWhiLft 


AAAA. GTTTG 


2580 


15 


GTCATCCTCA 


ACTTTCACAC 


CAAGGTGATC 


CCATTTTCTP 


1 aaoaaaaV^ A 


T TAG AT AC AA 


2 640 


ACGATCTGTC 


CTGGGATTGT 


TCAGCAATTG 


G*"AA A AAAAP 
lw 4. xvxvxvxvxvxvxv\^ 


AAl 1 ALbA 1 A 


AAATCATGTG 


2700 




GTTATGACAC 


ATACAGATTT 


AAAAC AGG T T 


TC^ AAPPA A A" 1 


A 1 vOOAV* A 1 i 


CC AGT TCAGT 


2760 




TTACTGACCA 


G AAT AG TTTT 


TA T AT 1 AAA 


AvJA X L — 1 1 AC 


1 v_ 1 a'o<j 1 AAo 


C7TAAAATTG 


2820 


20 


TTTTGGATCT 




i -IvjI 1 ^ aaaa 


Llol Aw AAA 


AAAGCCTATA 


TTGAGTTCTG 


2880 




TCTCTTTAAG 


P T T A A riCZCi A 


^ 1^ T T* *P P m T & T 
- O I llVw^. 1 A* 


o l Aox^L, AAGG 


GCTGAGGTGT 


GCTGCCTCAT 


2S40 






Vw^L — i i 1 1 


mp rn /-• j. 

IL-l ovAAWj 1 


TaACCATGAA 


ACAATGTTCA 


TTGTCAACAT 


3CO0 


25 


APPAfS AT AP-P 


AO J. AAAO AAA 


oo 1 bv 1 AA I A 


AGT A T AAC TT 


GACAATGTTC 


TGCACTTCTA 


3C60 






n^rJuuuUi .l o 


Al xAl AoAAC 


CAGAAGGAGA 


C AAAT CAT AC 


TCAGT TGAGG 


3120 




PAPTTCVTAf^A 


rn <n n m n rn 
1 1 — O 1 1 \3\* i. 


o 1 oi-. I vj^AAv_ 


CAGAAAACAT 


AATTGATCAA 


AAT G AC CAAC 


3180 


30 


A TGT AP AC d A 


CXCZ A AP A(^P A & 


1 Al AA1 l«-.lu 


AC- ACAT C AG T 


TTGGAGCTTT 


TGGGACTATG 


3240 


T l " n AA A A f^PP P 


ilil aaV-. 111 


Al AoCAAoCU 


AC TT T GG AT C 


CTTCTTTGAT 


ACAGTCAGAG 


3300 




TAGTATTGPT 


PATAP'"PT' T, T 

^—A J. Av_ _ V_ I A X 


ul Al a 


TXGCTTACC - 


CTGCTCTATT 


GTTGCCACAA 


3360 




T G^GTAGAGG 


P^ATfSTTAfSA 


nAUnnu 1 LI 1 




ATATATTGAA 


GACACTAATG 


3420 


35 


ATTATTCTTT 


GCTTTPPArG 

WW XXX Sb-V— -AW-0 


1U1 Xl*XOOlA 


AAuA^AU^AX 


CAC T AGAAG A 


AGACCTCCTC 


3480 




T AG ACTT C AG 


CGGTATATf4A 


j, X AfVfiJ. X wA..x 




X f~*f~* TV tv 7\ tr\t~* m 

AULAAATbTb 


TT T T AAAT T A 


3540 




AATAAGGCAG 


AA^GAT tgt a 

XVXV a \JXV X X »J X XV 


A TP A A ATA A A 


111 AAl 1 lU 1 


v- 1 AAATAAGT 


GT T AAAT AAG 


3600 


40 


CAAAC T AT AT 


AAAAAA T A m A 

xvxvxvxvxvxv X XV XV 


AAA AT A ATA A 
AAAA X AA 1 An 


AAAl AAAAAl 


■ P A ?A TV Ti Ti Tt H P 

LAA^AAAAAC 


AAAT AAAATG 


3 660 




TATAAAAACA 


AATAAAAAGG 


TPT*TAf5APP& 


An 1LJ. VjoCUo 


uAuv* C- T TAT T 


TAT T T AC AAA 


3720 




AAAAATTTTT 


TTGGTTTTTT 


TCAACTTTTT 


TCTATTTTTT 


GCTTTTTGTT 


TGTTTTTGTT 


3780 




TTTGTTTTTG 




ATTTTTTCTT 




TGGTTTTTGA 


TTTGTTTTTA 


3840 


45 


AACACAAATA 


CAT AT AT AT T 


TTATACTTCT 


ATATATATAT 


ATATATATAG 


TTTTTTATCT 


3900 




TGCTTTATTT 


T T TACT T AAA 


CATTCAAAAT 


AT T ATG AAC A 


AT AT AT AT T A 


T T TAT T ATT A 


3960 




AATGAAAACA 


TATTTAGATT 


TCATTATCAA 


ATGATATCTC 


TATCTTGTTA 


TCAGT AACAT 


4020 


50 


TCTCCTCTTC 


T T C AAC AG AT 


TTCTCTAAAT 


TAGCACTTAG 


CTCTGCAAGT 


TGTCTTCTAA 


4C80 




TTTGCTTTTC 


AGAATTGCCT 


TTAGGTATGA 


TTAACTTGCA 


GGCTTCAATG 


AAGGCTTGCG 


.4140 




ATTTGGCTCT 


AATAGCCCTA 


TTGATGGGTA 


TTATCATGCA 


GCTTTTATCC 


AGATCTGCTC 


4200 
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TTGGTGAATC ACAGAATTCT TTTGTCCAAG AAT AC AT G AC ACT AGC AAAA GAAACATCTT 42 60 

TCTTGTAAAC TTGACCACAT AATAAATGAA GCTGAACACA ATTCTCTGAA GTGTTGTTAA 4 32 0 
10 CTTTTGGAAT GG AC C AAT TT AG AT AAA-AAA CAAAACATAT AGGATCTTTA ATGCTTCCTT . 439 0 

GCCCTTTCAA AACAGTTCTG GCATTAACAC TCTTGTTAGG ATCAATTAAA GCTACAGCCA 4 440 

ATTTCCCATC AGGATCAGCT AT AG T AG GGC AT AT CC AG AT AACTATCCTT GAG AT CAT CA 4500 

TG^ATTGTTT TCTGCTATCC CAGGTTGGAT GTATCTTAAT TATCTTGGTT GCAGCTTTAT 4560 

15 

CACCATTACC TACAAAAAGA TCATTTTTCC AG CT GG AG AT AT GAT GAT T T GTATCAACAA 4 620 

TCATTCTAGC AGAAAGATCA TAAGACTCTG ATTCAGATAT CTGGTCAGAC TCATATGTGC 4 680 

CTAGTGATGA GGTGCCTGTA TTGTTCATCA AAATCTTCCC TTTAGAGCTA TCCATGGCTC 47 40 

20 T T T G G AG AAC TTCCTTGTTG TTGTGATTAG CAGAAGGATT GTGTACAACA ATGTCTGCTC 48C0 

CTTGTTTAGA GAGGGTAGTT GTAACCCTAG GG T GAT C TAG CTCCCTGCTG CTAGATGATC 48 60 

TGAGTGATTT GAAAAAACTA TTCATCTCTG G AC GG T AGAT TAAGAATTAA AAT AT T GAGA 4 920 

AGATAATGAT TGAATTCGGC TAG AT AT AAT TTTGATGCAC TGATTGCTCT 497C 
(2) INFORMATION FOR SEQ ID NO:21: 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3414 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: YES 



35 



40 



(vi) ORIGINAL SOURCE: 

<A) ORGANISM: IMPATIENS NECROTIC SPOT VIRUS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 1 : 

ATGAAAATTC TAAAGATGTG CGAGTTACTT GTTAAAATTA GT G TAT GT AC ACTAGTTGTT 60 

ACTTCTGTTA TCTTATCCTT TATGGCATTA AAGGAGACAG ATGCAAAGAT TCATGTTGAA 120 

AGGGGCGATC ATCCAGAAAT TTATGATGAA GCTTATTATG ACCGTTCTGT AGAT CAT AAA 180 

AATGAAATTC T AGAT ACT TT GGCTGAAATG CTCCAGAATG CAACAGGTAA AACCCTTAGA 240 

CCAAC ACG AG ATACTCAAAC TGTTCTTGCC AAT AAT G AGG TACCTCAATC TCCAAGTGGA 300 

TTGTCTTCTA CACCAACAAC TATATCAGTT AT GG AT T T AC CAAATCCATG CCTAAATGCT 3 60 

SO TCAAGTTTGA CTTGTAGCAT TAAAGGAGTT TCAACATTCA ATGTTTATTA CCAAGTAGAA 420 



45 



AGC AAT GG T G T TAT AT ATT C ATGTATCAGT GACACTATAA CTAAATTGGG AAACTGTGAA 480 
GGATCTTCTG AATTGCCAAG AAGCTTTGAA ACTGTTCCAG TTGTACCAAT AACAAAAATC 540 



55 
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15 



GATAACAAAA GG AAA? T AT C TATAGGAACT AAATTCTATA TAATTGAAAG TTTGGAGAAT 600 

TACAATTATC CAATCATGTA CAATTCTAGA CCAACTAATG GCACAGTCTC TCTCCAGAGT 6 60 

w GTGAAATTCT CAGGTC-ATTG TAAAATATCA AAAACAAACA TAGTTAATTC TTATACAGTT 720 
TCATTAACTA CACCTGAGAA AATCATGGGT TACGTCGTCA AAAGAGAAGG AAG T G AC A" G 
AGCCATTCCA TAATAAGCTT CTCTGGCTCA GTTAGTTTGA CTTTCACAGA AGAAAATATG 
GATGGTAAGC ATAACCTATT GTGCGGTGAC AAGTCTTCTA AAGTTCCTTT AGTGGACAAA 
AGAGTTAGAG ATTGTATTAT AAAGTACTCC AAAAAT AT T T ACAAACAAAC TGCTTGCATC 
AACTTCTCAT GGTTTAGATT GATCATGAT? GCTCTGATTG TCTATTTCCC TATAAGGTAC 
TTGGTAAACA AAACATCTAA AACACTGTTC TATGGGTATG ATCTTCTGGG ATT GAT AACG 
20 TATCCTATAC TACTAC^GAT AAATTATTTA TGGTC7TATT TTCCCTTGAA ATGTAAGGTC 
TGTGGAAATT TATGTTTGGT AACCCATGAA TGCTCAAAAT TATGCATTTG TAACAAAAAC 
AAAGCCTCTG AAGAACACTC AG AAG AA T G T CCTATAATAA CTAGAACAGC AG AG AAG AAT 

25 AAAAAAT AC A ACT GGGC TAG CATTGAATGG TTCCATTTGA TAGTTAATAC AAAAATAGGC 1320 

CTCTCTTTCC TAAAAGCAGT CACAGAAACT TTGATAGGAT TTCTGATTTT GTCACAGATG 1380 

CCTATGTCTA TGGCTCAAAC TGCTCAGTGT TTGGATAGCT GTTATTATGT CCCTGGTTGT 14 40 

GACCGCTTTG TTACAAACAG ATATGACAAA TGTCCTGAAA AAGACCAATG CTTCTGTGCT 1500 

30 

AT T AAAG AAA ATTCTATTGT AGAATCTAAC TTTCTAACCA ATGTTGTGAC AGAAGGTCCT 15 60 

ATGGATTGTA TACCTTATCA AGAATGCAAA GGCAGGATAA CTGAAAATGC TTTAGTAACT 1620 

TTTGTCAAAT GC AG AT T T GG TTGTGAGTAT GCCTCTATTT TCCAAAGCAA GCCTTTAGAT 1680 

35 AATGGGTTCT TAG AAT AT TC TGGTGACACA TTGGGATTGA ATGCTGTGAA TCTACACTTC 17 40 

AT GAAAAGGC TTAGGAATGG TATAATTGAT TTCTACAATA AAACTGAAAA ATATGGATAC 18 00 

AT T TC T GG AG ATGCACTTAA ATCTAATGAA T C AG AC AT AC CTGAAAGCAT TTTTCCAAGA 18 60 

AAATCTCTCA TATTTGACTC GGTGATAGAT GGAAAATATA GATACATGAT AG AAG AAT C T 1920 

TTATTATCAG GTGGGGGCAC AGTTTTCTCT CTTAATGACA AAAGTTCTAG CACTGCTCAG 1980 

AAGTTTGTTG TTTATATTAA AAAAGTTAGA ATACAGTATG ATGTTTCTGA AC AAT AC ACT - 2040 

ACAGCTCCTA TACAAAGCAC C C AC AC TG AT TTCTTTTCAA CATGCACAGG TAAATGCTCA 2100 

GATTGCAGAA AAGAACAACC GATAACTGGG TATCAAGATT TCTGCATAAC ACCAACATCT 2160 

TACTGGGGTT GTGAAGAGGT TTGGTGTTTG GCTATCAATG AAGGAGCCAC TTGTGGGTTT 2220 

TGTAGAAATG TGTATGATAT GGATCAATCT TTCAGGATTT ATTCTGTTAT TAAATCAACA 22 80 

50 ATCAAGTCTG AAGTATGTAT AT CT GG ATT T GTGGGAGCAA AGTGTTTCAC TGTATCTGAG 2340 

GAAGTCCCAT CTGAATCAGG ATATTTCCAG GCTGATATAT TAC-CGGATTT CCATAATGAT 24C0 



40 
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780 
840 
9C0 
960 
1C20 
1080 
1140 
1200 
1260 



GGCTTAACAA TAGGCCAGCT GATAGCTCAT GGACCTGATA GTCACGTATA TGCGGGAAAC 2 4 60 
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5 





ATAGCTAGAT 


TAAATAATCC 


ATCAAAAATG 


TTTGGTCATC 


CTCAACTTTC 


ACACCAAGGT 


2520 




GATCCCATTT 


TCTCTAAGAA 


AACATTAGAT 


ACAAACGATC 


TGTCCTGGGA 


TTGTTCAGCA 


2580 


10 


ATTGGTAAAA 


AAACAATTAC 


GATAAAATCA 


TGTGGTTATG 


ACACATACAG 


ATT T AAAAC A . 


2640 




GGTTTGAACC 


AAATATCGGA 


CATTCCAGTT 


CAGTTTACTG 


ACCAGAATAG 


TTTTTATATG 


2700 




GAAAAGATC7 


TTAGTCTAGG 


TAAGCTTAAA 


ATTGTTTTGG 


ATCTACCTTC 


TGAATTGTTT 


2760 




AAAACTGTAC 


CAAAAAAGCC 


TATATTSAGT 


TCTGTCTCTT 


TAAGCTGTAA 


GGGATGTCTC 


2820 


■IK. 
TO 


TTATGTAGCC 


AAGGGCTGAG 


GTGTGCTGCC 


TCATTTATAT 


CAGACATAAC 


TTTTTCTGCA 


2880 




AGGTTAACCA 


TGAAACAATG 


TTCATTGTCA 


AC AT AC C AG A 


T AGC AG T AAA 


GAAAGGTGCT 


2940 




AATAAGTATA 


ACTTGACAAT 


GTTCTGCACT 


TCTAACCCAG 


AAAAACAAAA 


AATGATTATA 


3000 


20 


GAACCAGAAG 


GAGACAAATC 


ATACTCAGTT 


GAGGCACTTG 


TAGATTCTGT 


TGCTGTGCTG 


3060 




GAACCAGAAA 


ACATAATTGA 


T C AAAATG AC 


CAACATGCAC 


ACGAGGAACA 


GCAATATAAT 


3120 




TCTGACACAT 


C AG T T T GG AG 


CTTTTGGGAC 


TATGTTAAAA 


GCCCTTTTAA 


CTTTATAGCA 


318C 


25 


AGCCACTTTG 


GATCCTTCTT 


TGATACAGTC 


AG AG TAG TAT 


TGCTCATACT 


CTTTGTATTT 


3240 


GCTCTTGCTT 


ACCTCTGCTC 


TATTGTTGCC 


ACAATGTGTA 


GAGGCTATGT 


TAGAAACAAG 


33C0 




TCTTACAAGA 


CAAAATATAT 


TGAAG AC AC T 


AATGATTATT 


CTTTGGTTTC 


CACGTCTTCT 


3360 




GGTAAAGACA 


C CAT C AC TAG 


AAGAAGACCT 


CCTCTAGACT 


TCAGCGGTAT 


ATGA 


3414 


30 


(2) INFORMATION FOR SEQ ID NO: 22: 










35 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 912 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 










(ii) MOLECULE TYPE : cDNA 












<iii) HYPOTHETICAL ; 


: NO 
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(iii) ANTI-SENSE : YES 










(vi) ORIGINAL SOURCE: 

(A) ORGANISM: IMPATIENS NECROTIC SPOT VIRUS 








(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 








45 


TTAGATTTCA 


TTATCAAATG 


ATATCTCTAT 


CTTGTTATCA 


GTAACATTCT 


CCTCTTCTTC 


60 




AACAGATTTC 


TCTAAATTAG 


CACTTAGCTC 


TGCAAGTTGT 


CTTCTAATTT 


GCTTTTCAGA 


120 




ATTGCCTTTA 


GGTATGATTA 


ACTTGCAGGC 


TTCAATGAAG 


GCTTGCGATT 


TGGCTCTAAT 


180 


50 


AGCCCTATTG 


ATGGGTATTA 


TCATGCAGCT 


TTTATCCAGA 


TCTGCTCTTG 


GTGAATCACA 


240 




GAATTCTTTT 


GTCCAAGAAT 


ACATGACACT 


AGCAAAAGAA 


ACATCTTTC? 


TGTAAACTTG 


30C 




ATCACATAAT 


AAA TG AAGC T 


GAACACAATT 


CTCTGAAGTG 


TTGTTAACTT 


TTGGAATGGA 


360 
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10 



15 



25 



30 



45 



420 
480 
540 
600 
660 
720 
780 
840 
900 
912 



CCAATTTAGA TAAAAAACAA AACATATAGG ATCTTTAATG CTTCCTTGCC CTTTCAAAAC 
AGTTCTGGCA TTAACACTCT TGTTAGGATC AATTAAAGCT ACAGCCAATT TCCCATCAGG 
ATCAGCTATA GTAGGGCATA TCCAGATAAC TAT CC T T GAG ATCATCATGT ATTGTTTTCT 
GCTATCCCAG GTTGGATGTA TCTTAATTAT CTTGGTTGCA GCTTTATCAC CAT T AC C T AC 
AAAAAGATCA TTTTTCCAGC T GG AG AT AT G ATGATTTGTA TCAACAATCA TTCTAGCAGA 
AAGATCATAA GACTCTGATT CAGATATCTG GTCAGACTCA TATGTGCCTA GTGATGAGGT 
GCCTGTATTG TTCATCAAAA TCTTCCCTTT AGAGCTATCC ATGGCTCTTT GG AG AAC T T C 
CTTGTTGTTG TGATTAGCAG AAGGATTGTG TACAACAATG TCTGCTCCTT GTTTAGAGAG 
GGTAGTTGTA ACCCTAGGGT GATCTAGCTC CCTGCTGCTA GATGATC TGA GTGATTTGAA 
20 AAAACTATTC AT 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 446 base pairs 
(3) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

( vi) ORIGINAL SOURCE: 

(A) ORGANISM: Tobacco mosaic virus 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

GGATCCGGAA CATGGTGGAG CACGACACGC TTGTCTACTC CAAAAATATC AAAGATACAG 
TCTCAGAAGA CCAAAGGGCA AT TGAG ACT T TTCAACAAAG TTATTGTGAA GATAGTGGAA 
^ AAGGAAGGTG GCTCCTACAA ATGCCATCAT TGCGATAAAG GAAAGGCCAT CGTTGAAGAT 180 

GCCTCTGCCG ACAGTGGTCC CAAAGATGGA CCCCCACCCA CGAGGAGCAT CGTGGAAAAA 2 40 

GAAGACGTTC CAACCACGTC TTCAAAGCAA GTGGATTGAT GTGATATCTC CACTGACGTA 
AGGGATGACG CACAATCCCA CTATCCTTCG CAAGACCCTT CCTCTATATA AGGAAGTTCA 
TTTCATTTGG AG AGG AC T T T TTACAACAAT TACCAACAAC AACAAACAAC AAACAACATT 
ACAATTACTA TTTACAATTA CCCGGG 
(2) INFORMATION FOR SEQ ID NO: 24: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 303 amino acids 
(3) TYPE: amino acid 

(C) " STRAND EDNESS! single _ ~" — 

CD) TOPOLOGY: unknown 

55 



60 

12 0 



300 
360 
420 
446 
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<ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

{iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : IMPATIENS NECROTIC SPOT VIRUS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Met Asn Ser Fhe Phe Lys Ser Leu Arg Ser Ser Ser Ser Arg Glu Leu 
15 10 15 

Asc His Pro Arg Val Thr Thr Thr Leu Ser Lys Gin Gly Ala Asp lie 
20 25 30 

Val Val His Asn Pro Ser Ala Asn His Asn Asn Lys Glu Val Leu Gin 
35 40 45 

Arg Ala Met Asp Ser Ser Lys Glv Lys lie Leu Met Asn Asn Thr Gly 
50 55 60 

Thr Ser Ser Leu Gly Thr Tyr Glu Ser Asp Gin lie Ser Glu Ser Glu 
65 70 75 80 

Ser Tyr Asp Leu Ser Ala Arg Met He Val Asp Thr Asn His His He 
85 90 95 

Ser Ser Trp Lys Asn Asp Leu Phe Val Glv Asn Gly Asp Lys Ala Ala 
100 ICS 110 

Thr Lys lie He Lys lie His Pro Thr Trp Asp Ser Arg Lys Gin Tyr 
115 120 125 

Met Met He Ser Arg He Val He Trp He Cys Pro Thr lie Ala Asp 
130 135 140 

Pro Asp Gly Lys Leu Ala Val Ala Leu lie Asp Pro Asn Lys Ser Val 
145 150 155 160 

Asn Ala Arg; Thr Val Leu Lys Gly Gin Gly Ser He Lys Asp Pro He 
165 170 175 

Cys Phe Val Phe Tvr Leu Asn Trp Ser He Pro Lys Val Asn Asn Thr 
180 " 185 190 

Ser Glu Asn Cys Val Gin Leu His Leu Leu Cys Asp Gin Val Tyr Lys 
195 200 205 

Lvs Asp Val Ser Phe Ala Ser Val Met Tvr Ser Trp Thr Lys Glu Phe 
210 215 220 

Cys Asp Ser Pro Arg Ala Asp Leu Asp Lys Ser Cys Met He He Pro 
225 230 235 240 

He Asn Arg Ala He Arg Ala Lys Ser Gin Ala Phe He Glu Ala Cys 
245 250 255 



Lvs Leu He He Pro Lys Gly Asn Ser Glu Lys Gin He Arg Arg Gin 
260 265 270 

Leu Ala Glu Leu Ser Ala Asn Leu Glu Lys Ser Val Glu Glu Glu Glu 
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275 2SC 



285 



Asn Val Thr Asp Asn Lys lie Glu He Ser Phe Aso Asn Glu n e 

255 

) INFORMATION FOR SEQ ID NO : 2 5 : 

(i) SEQUENCE CHARACTERISTICS - 

(A) LENGTH: 2 62 a-.ino'acids 

(B) TYPE: amino acid 

(C) STRANDEDNSSS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: IMPATIENS NECROTIC SPOT VIRUS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Met Asn Lys Ala Lys He Thr Lys Glu Asn He Val Lys Leu Leu Thr 
x 5 io 



15 



Gin Ser Asp Ser Leu Glu Phe Glu Glu Thr Gin Asn Glu Gly Ser Phe 
20 2 5 30 

Asn Phe Thr Asp Phe Phe Thr Asn Asn Arg Glu Lys He Gin Asn Met 

J J A Pi „ _ 



35 40 

Thr Thr Ala Ser Cys Leu Ser Phe Leu Lys Asn Arg Gin Ser lie Met 
50 55 60 

Arg val lie Lys ser Ala Asp Phe Thr Phe Gly Ser Val Thr lie Lys 
65 70 "75 80 

Lys Thr Arg Asn Asn Ser Glu Arg Val Gly Val Asn Asp Met Thr Phe 
85 go g5 

Arg Arg Leu Asp Ala Met Val Arg Val His Leu Val Gly Met He Lys 
100 105 110 

Asp asn Gly Ser Ala Leu Thr Glu Ala He Asn Ser Leu Pro Ser His 
115 120 125 

Pro Leu He Ala Ser Tyr Gly Leu Ala Thr Thr Asn Leu Lys Ser Cys 
XJU 135 140 

Val Leu Gly Val Leu Leu Gly Gly Ser Leu Pro Leu He Ala Ser Val 
145 150 155 i 6 o 

Leu Asn Phe Glu lie Ala Ala Leu val Pro Ala He Tyr Gin Aso Ala 

170 i-jg 

Lys His Val Glu Leu Gly He As? Met Ser Lys Phe Ser Thr Lys Glu 
180 185 igo 

Ala Val Sly Lys Val Cys Thr Val Leu Lys Ser Lys Gly Tyr Ser Met 
-9a 200 205 
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10 



15 



40 



45 



Asn Ser val Glu lie Gly Lys Ala Lys Gin Tyr Ala Asp lie Leu Lys 
210 215 220 

Ala Cys Ser Pro Lys Ala Lvs Glv Leu Ala Ala Met Asp nis Tyr Lvs 
225 230 "* " 235 240 

Glu Glv Leu Thr Ser lie Tyr Ser Met Phe Asn Ala Thr He Asp Phe 
245 250 255 

Gly Lys Asn Asp Ser He 
260 

(2) INFORMATION FOR SEQ ID NO : 2 6 : 



(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 449 amino acids 
(5) TYPE: amino acid 
(C) STRANDEDNESS : single 
20 (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: pro- ein 

{iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

25 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: IMPATIENS NECROTIC SPOT VIRUS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

30 Met Ser ser Ala Met. Tyr Glu Thr lie He Lys Ser Lys Ser Ser He 

1 5 10 15 

Trp Glv Thr Thr Ser Ser Gly Lys Ala Val Val Asp Ser Tyr Trp He 
20 25 30 

His Asp Gin Ser Ser Gly Lys Lys Leu Val Glu Ala Gin Leu Tyr Ser 
35 3 5 4 0 4 5 

Asp Ser Arg Ser Lys Thr Ser Phe Cys Tyr Thr Gly Lys Val Gly Phe 
50 55 60 



Leu Pro Thr Glu Glu Lys Glu lie lie Val Arg Cys Phe Val Pro lie 
65 70 75 80 

Phe Asp Asn He Asp Leu Asn Phe Ser Phe Ser Gly Asr. Val Val Glu 
85 90 95 

He Leu Val Arg Ser Asn Thr Thr Asn Thr Asn Gly Val Lys His Gin 
100 105 HO 

Gly His Leu Lys Val Leu Ser Ser Gin Leu Leu Arg Met Leu Glu Glu 
115 120 125 

Gin He Ala Val Pro Glu He Thr Ser Arg Phe Gly Leu Lys Glu Ser 
130 135 140 

50 Aso He Phe Pro Pro Asn Asn Phe He Glu Ala Ala Asn Lys Gly Ser 

145 150 155 160- 

Leu Ser Cys Val Lys Glu Val Leu Phe Asp Val Lys Tyr Ser Asn Asn 
165 170 175 

55 
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Gin Ser Met Gly Lys Val Ser Val Leu Ser Pre Thr Arg Ser Val His 
180 185 i9 C 

Glu Trp Leu Tyr Thr Leu Lys ?rc Val ?he Asn Gin Ser Gin ?sn 
y ° 2CC 205 

Asn Arg Thr Val Asn Thr Leu Ala Val Lys Ser Leu Ala Met S— *la 

215 220 

Thr Ser Asp Leu Met Ser Asp Thr His Ser ?he Val Arg Leu Asn fsn 

23C 235 2*40 

Asn Lys Pro Phe Lys lie Ser Leu Trp Met Arg He Pro Lys He Met 
245 25C 255 

Lys Ser Asn Thr Tyr Ser Arg Phe Phe Thr Leu Ser Asp Glu Ser Se- 
2S0 265 * 270 

Pro Lys Glu Tyr Tyr He Ser Zle Gin Cys Leu Pro Asn His Asn Asn 
275 280 285 

Val Glu Thr Val lie Glu Tyr Asn Phe Asp Gin Ser Asn Leu Phe Leu 
"° 295 300 

Asn Gin Leu Leu Leu Ala Val He His Lys He Glu Met Asn Phe Ser 
305 310 315 320 

Asp Leu Lys Glu Pro Tyr Asn Val He His Asp Met Ser Tyr Pro Gin 
325 330 335 

Arg He Val His Ser Leu Leu Glu He His Thr Glu Leu Ala Gin Tb- 
340 345 350 

Val Cys Asp Ser Val Gin Gin Asp Met He Val Phe Thr He Asn Glu 
355 360 3 6 5 

Pro Asp Leu Lys Pro Lys Lys Phe Glu Leu Gly Lys Lys Thr Leu Asn 

375 380 

Tyr Ser Glu Asp Gly Tyr Gly Arg Lys Tyr Phe Leu Ser Gin Thr Leu 
385 390 395 400 

Lys Ser Leu Pro Arg Asn Ser Gin Thr Met Ser Tyr Leu Asp Ser Xle 

405 410 415 

Gin v etl Pro Trp Lys phe Asp Tyr ftla A1& Q ^ ^ 

420 425 



Ser Pro Arg Ser Glu Asp Val Leu Lys Ala He Ser Lys Leu Asp Leu 
435 440 



430 

445 



Asn 



INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1137 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
O) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



46 



EP 0 566 525 A2 



(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: I MP AT I ENS NECROTIC SPOT VIRUS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

Me- Lys lie Leu Lys Met Cys Glu Leu Leu Val Lys lie Ser Val Cys 
15 10 15 

Thr Leu Val Val Thr Ser Val He Leu Ser ?he Met Ala Leu Lys Glu 
20 25 30 

Thr Aso Ala Lys He His Val Glu Arg Gly Asp His Pro Glu He Tyr 
35 40 45 

Asp Glu Ala Tyr Tyr Asp Arg Ser Val Asp His Lys Asn Glu He Leu 
50 55 60 

Asp Thr Leu Ala Glu Met Leu Gin Asn Ala Thr Gly Lys Thr Leu Arg 
65 70 75 80 

Pro Thr Arg Asd Thr Gin Thr Val Leu Ala Asn Asn Glu Val Pro Gin 
85 90 95 

Ser Pro Ser Gly Leu Ser Ser Thr Pro Thr Thr He Ser Val Met Aso 
100 105 110 

Leu Pro Asn Pro Cys Leu Asn Ala Ser Ser Leu Thr Cys Ser lie Lys 
115 120 125 

Glv Val Ser Thr Phe Asn Val Tyr Tyr Gin Val Glu Ser Asn Gly Val 
130 135 140 

He Tyr Ser Cys He Ser Asp Thr He Thr Lys Leu Gly Asn Cys Glu 
145 150 155 160 

Glv Ser Ser Glu Leu Pro Arg Ser Phe Glu Thr Val Pro Val Val Pro 
165 170 175 

He Thr Lys He Asn Asn Lys Arg Lys Leu Ser He Gly Thr Lys Phe 
180 ' 185 190 

Tvr He He Glu Ser Leu Glu Asn Tyr Asn Tyr Pro He Met Tyr Asn 
195 200 205 

Ser Arg Pro Thr Asn Gly Thr Val Ser Leu Gin Ser Val Lys Phe Ser 
210 215 220 

Gly Asp Cys Lys He Ser Lys Thr Asn He Val Asn Ser Tyr Thr Val 
225 230 235 240 

Ser Leu Thr Thr Pro Glu Lys He Met Gly Tyr Val Val Lys Arg Glu 
245 250 255 

Gly Ser Asp Met Ser His Ser He He Ser Phe Ser Gly Ser Val Ser 
260 , 265 270 

Leu Thr Phe Thr Glu Glu Asn Met Asd Gly Lys His Asn Leu Leu Cys 
275 280 285 
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Gly Asp Lys Ser Ser Lys Val P ro Leu val Asp Lys Arg Val Arg Asp 

Cys lie He Lys Tyr Ser Lys Asn He Tyr Lys Gin Thr Ala Cys lie 

J1 ° 315 320 
Asn Phe Ser Trp Phe Arg Leu He v e - Ile Ala Leu 

325 330 3 Y 3 l " 

Pro lie Arg Tyr Leu Val Asn Lys Thr Ser Lys Thr Leu Phe Tyr C^ y 

345 35 0 

Tyr Asp Leu Leu Gly Leu He Thr Tyr Pro Ile Leu Leu Leu r e ^ 

->60 365 

Tyr Leu Trp Ser Tyr Phe Pro Leu Lys Cys Lys Val Cys Gly Asn Leu 
w 375 380 

Hi VSl Thr HiS f& C - vs Ser Lys Leu Cys lie Cys Asn Lys Asn 

J3U 395 400 

Lys Ala Ser Glu Glu Kis Ser Glu Glu Cys Pro lie lie Thr Arg Tt- 
405 410 41 | ' 

Ala Glu Lys Asn Lys Lys Tyr Asn Trp Ala Ser lie Glu Trp Phe His 
H * J 425 43 0 

Leu Ile Val Asn Thr Lys He Gly Leu Ser Phe Leu Lys Ala Val Thr 

^ J;> 440 445 

Glu Thr Leu lie Gly Phe Leu He Leu Ser Gin Met Pro Met Ser Met 
■ 3U 455 460 

Ala Gin Thr Ala Gin Cys Leu Asp Ser Cys Tyr Tyr Val Pro Gly Cys 
" 470 . 475 * 4I0 

Asp Arg Phe Val Thr Asn Arg Tyr Asp Lys ; Cys Pro Glu Lys Asp Gin 
485 490 - ~ - 



495 



Cys Phe Cys Ala lie Lys Glu Asn Ser lie Val Glu Ser Asn Phe Leu 
ouu 505 510 

Thr Asn Val Val Thr Glu Gly Pro Met Asp Cys lie Pro Tyr Gin G'u 
315 520 5 25 

Cys Lys Gly Arg He Thr Glu Asn Ala Leu Val Thr Phe Val Lys Cys 
OJU 535 540 

Ar| Phe Gly Cys Glu Tyr Ala Ser lie Phe Gin Ser Lys Pro Leu Asp 

5SU 555 560 

Asn Gly Phe Leu Glu Tyr Ser Gly Asp Thr Leu Gly Leu Asn Ala Val 
565 570 5?5 

Asn Leu His Phe Met Lys Arg Leu Arg Asn Gly Ile Ile Asp Phe Tyr 
30U 585 590 

Asn Lys Thr Glu Lys Tyr Gly Tyr He Ser Gly Asp Ala Leu Lys Ser 
- 95 600 60S 

Asn C-lu Ser Asp He Pro Glu Ser lie Phe Pro Arg Lys Ser Leu He 
°- J 615 620 

Phe Asp Ser Val He Asp C-ly Lys Tyr Arg Tyr Met Ile Glu Glu Ser 



48 



EP 0 566 525 A2 



625 630 635 640 

Leu Leu Ser Glv Glv Gly Thr Val Phe Ser Leu Asn Asp Lvs Ser Ser 
645 650 ' 655 

Ser Thr Ala Gin Lys Phe Val Val Tyr lie Lys Lys Val Arg lie Gin 
66C 665 670 

Tyr Asp Val Ser Glu Gin Tyr Thr Thr Ala Pro He Gin Ser Thr Kis 
675 680 685 

Thr Asp Phe Phe Ser Thr Cys Thr Gly Lys Cys Ser Asp Cys Arg Lys 
690 695 700 

Glu Gin Pre He Thr Gly Tyr Gin Asp Phe Cys lie Thr Pro Thr Ser 
705 710 715 720 

Tvr Trp Glv Cys Glu Glu Val Trp Cys Leu Ala He Asn Glu Gly Ala 
725 730 735 

Thr Cys Gly Phe Cys Arg Asn Val Tyr Asp Met Asp Gin Ser Phe Arg 
740 745 750 

He Tyr Ser Val He Lys Ser Thr He Lys Ser Glu Val Cys He Ser 
755 760 765 

Gly Phe Val Gly Ala Lys Cys Phe Thr Val Ser Glu Glu Val Pro Ser 
770 775 780 

Glu Ser Glv Tyr Phe Gin Ala Asp He Leu Ala Asp Phe His Asn Asp 
785 " 790 795 800 

Gly Leu Thr He Gly Gin Leu He Ala His Gly Pro Asp Ser Kis Val 
805 810 815 

Tyr Ala Gly Asn He Ala Arg Leu Asn Asn Pro Ser Lys Met Phe Gly 
820 825 830 

His Pro Gin Leu Ser His Gin Gly Asp Pro He Phe Ser Lys Lys Thr 
835 840 845 

Leu Asp Thr Asn Asp Leu Ser Trp Asp Cys Ser Ala He Gly Lys Lys 
850 855 860 

Thr He Thr He Lys Ser Cvs Gly Tyr Asp Thr Tyr Arg Phe Lys Thr 
865 870 " 875 880 

Gly Leu Asn Gin He Ser Asp He Pro Val Gin Phe Thr Asp Gin Asn 
885 890 895 

Ser Phe Tyr Met Glu Lys He Phe Ser Leu Gly Lys Leu Lys He Val 
900 905 910 

Leu Asp Leu Pro Ser Glu Leu Phe Lys Thr Val Pro Lys Lys Pro He 
915 920 925 

Leu Ser Ser Val Ser Leu Ser Cvs Lys Gly Cys Phe Leu Cys Ser Gin 
930 935 940 

Gly Leu Arg Cys Ala Ala Ser Phe He Ser Asp He Thr Phe Ser Ala 
S45 950 955 960 

Arg Leu Thr Met Lys Gin Cvs Ser Leu Ser Thr Tyr Gin He Ala Val 
965 - 970 975 
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Lys Lys Gly Ala Asn Lys Tyr Asn Leu Thr Met Phe Cys Thr Ser Asn 



9S0 

Pre Glu Lys Gin Lys Met He He Glu Pro Glu Gl. 

100 



„ ec AAe IAe Q Glu Pro Glu Gly AspLys Ser Tyr 



Ser yel Glu Ala Lea Val As? Ser Val Ala Val Leu Glu P rc Glu Asn 

1015 1020 

Ile_Ile Asp Gin Asn As^Gln His Ala His Glu Glu Gin Gin Tyr Asn 

1035 1040 

Ser Asp Thr Ser Val Trp Ser Phe Trp Asp Tyr Val Lys Ser Pro Phe 

^° 45 1°5C 1C55 

Asn Phe lie Al^Ser Kis Phe Gly Serine Phe Asp Thr Val Arg Val 

Val Leu Leu He Leu Phe Val Phe Ala Leu Ala Tyr Leu Cys Ser He 

1Q3G 1085 
val Al^Thr Met Cys Arg GlyTyr Val Arg Asn Ly, S« Tyr Lys Thr 



Lys^yr lie Glu Asp Thr Asa Asp Tyr Ser Leu Val S er Thr Ser Ser 

1115 112 

C-ly Lys Asp Thr lie Thr Arg Arg Arg Pro Pro Leu Asp Phe Ser Gly 

1125 1130 * 
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Claims 



1. Recombinant INSV DMA constructs comprising a DNA sequence coding for transcription into 

a) an RNA sequence of an INSV or an RNA sequence homologous thereto • 

b) an RNA sequence of an INSV or an RNAsequence homologous thereto capable of encoding for an INSV 
prote.n or a part thereof , in which one or more codons have been replaced by synonyms , or an RNA se- 
quence homologous thereto ; or 

c) an RNA sequence complementary to an RNA sequence according to a) or b) 

Wh,t ; h a I DNA iS L UndSr expression contral of a P ra ™te' and a terminator capable of functioning in plants 
2 A DNA construct according to Claim 1 , wherein the INSV DNA sequences code for transcription into: 

i) the viral S RNA nucleotide sequence from 1 to 3017 (SEQ. ID No.1) 

ii) the viral S RNA nucleotide sequence from position 25 to 3017 (SEQ. ID No.2); 

iii) the viral S RNA nucleotide sequence from 87 to 1436 (SEQ. ID No.3); 
iy) the viral S RNA nucleotide sequence from 2080 to 2868 (SEQ. ID No.4); 

v) the viral S RNA " pan-handle " structure comprising : 

viral f SRNA de0tide S6qUenCe ° f fr ° m abOUt 30 1 ° about 36 nuc| eotides in length from the 5' end of the 
and 

b) a second nucleotide sequence of from about 30 to about 36 nucleotides in length from the 3' end of 

the viral S RNA 

vi) the viral S RNA nucleotide sequence from 1437 to 2079; (SEQ ID No. 7) 

vii) the viral S RNA nucleotide sequence from 1440 to 2041; (SEQ ID No.8) 

viii) the viral complementary S RNA nucleotide sequence from 1 to about 3017; (SEQ ID No 9) 

ix) the viral compl mentary S RNA nucleotide sequence from 1 to 2993; (SEQ ID No 10) 

x) the viral complementary S RNA nucleotide sequence from 150 to 938; (SEQ ID No 11) 

xi) the S RNA nucleotide sequence from 1581 to 2930 of the viral complementary S RNA strand" (SEQ ID 
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No.12); 

xii) the viral complementary S RNA secondary structure having a nucleotide sequence of 642 nucleotides 
from 939 to 1580; (SEQ ID No.13) 

xiii) S RNA nucleotide sequence from 87 to 1436 in which one or more codons have been replaced by their 
synonyms; 

xiv) S RNA nucleotide sequence from 2080 to 2868 in which one or more codons have been replaced by 
their synonyms; 

xv) the M RNA nucleotide sequence from 1 to 4970 (SEQ ID No.14); 

xvi) the M RNA sequence from 86 to 997 (SEQ ID No.15); 

xvii) the M RNA sequence of the intergenic region from 998 to 1470 (SEQ ID No. 16); 

xviii) the M RNA sequence from 1471 to 4884; (SEQ ID No. 17) 

xix) the M RNA "pan-handle" structure comprising : a) a first nucleotide sequence of from about 30 to about 
36 nucleotides in length from the 5* end of the viral M RNA 

and 

b) a second nucleotide sequence of from about 30 to about 36 nucleotides in length from the 3' end of the 
viral M RNA; 

xx) the complementary viral M RNA sequence from 1 to 4970; (SEQ ID No.20) 

xxi) the complementary viral M RNA sequence from position 87 to position 3500 of the complementary 
viral M RNA sequence; ( SEQ ID No.21) 

xxii) the complementary viral M RNA sequence from position 3974 to 4885 (SEQ ID No.22) 

xxiii) RNA sequences homologous to the nucleotide sequences defined under i) to xii) and xv) to xxii) here- 
inabove. 

xxiv) fragments of sequences defined under i) to xxii) hereinabove. 

3. A DNA construct according to Claim 1, wherein the DNA sequence codes for transcription into INSV- 
RNA sequences of a pan-handle, or into RNA sequences homologous thereto. 

4. A DNA construct according to Claim 1 wherein the DNA sequence codes for transcription into INSV-RNA 
sequences of a pan-handle wherein the pan-handle structure comprises two complementary strands compris- 
ing 36 nucleotides in length. 

5. A DNA construct according to Claim 1, wherein the DNA sequence codes for transcription into INSV RNA 
sequences of an open reading frame in viral complementary sense, or into corresponding RNA sequences in 
which one or more codons have been replaced by synonyms thereof, or into RNA sequences homologous 
thereto. 

6. A DNA construct according to Claim 1 , wherein the the DNA sequence codes for transcription into INSV- 
RNA sequences of a secondary structure, or into RNA sequences homologous thereto. 

7. A DNA construct according to Claim 1 wherein the DNA sequence codes for transcription into INSV-RNA 
sequences, or into INSV-RNA sequences in which one or more codons have been replaced by synonyms there- 
of, or into RNA sequences homologous thereto of at least 15 nucleotides. 

8. A DNA construct according to Claim 6, wherein the DNA sequence codes for transcription into INSV- 
RNAsequences in which one or more codons have been replaced by synonyms thereof, or into RNAsequences 
homologous thereto of at least 400 nucleotides. 

9. A DNA construct according to Claim 1 , wherein the DNA sequence codes for transcription into a com- 
bination of the 5' and 3' terminal sequences of the viral S, M or L RNA respectively. 

1 0. A DNA construct according to Claim 1 , wherein the promoter is a viral, fungal, bacterial, animal or plant- 
derived promoter capable of functioning in plant cells. 

11 A DNA construct according to Claim 9, wherein the terminator is a viral fungal, bacterial, animal or plant- 
derived terminator capable of functioning in plant cells. 

12 A plant comprising in its genome a DNA construct in accordance with Claim 1. 

13 A probe comprising a single or double stranded oligonucleotide sequence complementary to an RNA 
sequence of an INSV. 

14 A probe according to Claim 13, wherein the oligonucleotides are complementary to an INSV S RNA 
sequence, an INSV M RNA nucleotide sequence or to fragments of such sequences comprising at least 12 
nucleotides. 

15 A probe according to Claim 12 or Claim 13, wherein the oligonucleotide sequence has from 12 to 800 
nucleotides. 

16 A process of preparing plants according to Claim 11, which comprises 

a) inserting into the genome of a plant cell a DNA construct of Claim 1 , 

b) obtaining transformed ceils; 

c) regeneration from the transformed cells genetically transform d plants. 
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Recombinant Impatiens Necrotic Spot Virus 
(INSV) DNA constructs comprising an INSV DNA 
coding for transcription into INSV RNA sequ- 
ences or into RNA sequences related thereto, 
the use of such DNA constructs to transform 
plants having reduced susceptibility to INSV 
infection and probes for the isolation of INSV or 
diagnosis of plant INSV related diseases. 
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